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ALIGNMENTS 



RESULT 1 
AAE13596 

ID AAE13596 standard; Protein; 390 AA. 
XX 

AC AAE13596; 
XX 

DT 26-FEB-2002 (first entry) 
XX 

DE Porcine transforming growth factor beta 1 (TGF-betal) mutant. 
XX 

KW Porcine; transforming growth factor beta 1; TGF-betal; gene therapy; 

KW IBD; inflammatory bowel disease; autoimmune disease; immunosuppressive; 

KW multiple sclerosis; rheumatoid arthritis; systemic lupus erythematosus; 

KW diabetes mellitus; sarcoidosis; psoriasis; dermatological ; mutant; 

KW mutein. 
XX 

OS Sus scrofa. 
XX 

FH Key Location/Qualifiers 

FT Misc-dif f erence 223 

FT /note= "Wild type Cys substituted with Ser" 

FT Misc-diff erence 225 

FT /note= "Wild type Cys substituted with Ser" 

XX 

PN WO200181404-A2 . 
XX 

PD 01-NOV-2001. 
XX 

PF 20-APR-2001; 2001WO-US12 980 . 
XX 

PR 20-APR-2000; 2000US-199014P . 
XX 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICES. 
XX 

PI Strober W, Nakamura K, Kitani A, Fuss IJ; 
XX 

DR WPI; 2002-026155/03. 

DR N-PSDB; AAD22696. 
XX 

PT Composition for treating autoimmune diseases e.g. inflammatory bowel 

PT disease in humans, comprises vector containing transforming growth 

PT factor-beta under the control of inducible promoter 
XX 

PS Example 1; Fig 1; 7 8pp; English. 
XX 

CC The invention relates to a composition containing a vector comprising a 

CC gene encoding a regulatory transcription factor under the control of a 

CC promoter encoding a transforming growth factor-beta (TGF-beta) . The 

CC vector is useful for expressing TGF-beta, such as TGF-betal, TGF-beta2 

CC or TGF-beta3, its variants or homologues, by transfecting a cell which 

CC is part of a host suspected of having an autoimmune disease, especially 

CC inflammatory bowel disease (IBD), under conditions such that the 

CC polypeptide encoded by the nucleic acid sequence in the vector is 

CC expressed. The vector is delivered using a delivery system. The delivery 

CC of the vector results in substantial elimination of symptoms of the 

CC autoimmune disease and increased production of IL-10 by the host. The 

CC composition is useful for treating various diseases with an autoimmune 



CC component such as multiple sclerosis, rheumatoid arthritis, systemic 

CC lupus erythematosus, insulin-dependent diabetes mellitus, sarcoidosis 

CC and psoriasis, and also for assaying the expression of a gene in a cell. 

CC The vector is further useful for screening of the effect of test 

CC compounds on cytokine (e.g. TGF-beta) expression of transfected cells. 

CC The present sequence is porcine TGF-betal mutant. 

XX 

SQ Sequence 3 90 AA; 

Query Match 95.9%; Score 2050.5; DB 23; Length 390; 

Best Local Similarity 96.8%; Pred. No. 7.6e-176; 

Matches 390; Conservative 0; Mismatches 0; Indels 13; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 
Db 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKT I DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

IIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

IIIIIIIIMIMIIIIIIIMIIMMIIIIIMIIIIIIIIIIIMIIIIIIIIMII 

Db 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEI NGFN 24 0 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIII 
Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

llllllllllllllllllllllllllllllllllllll MINIMI 
Db 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFS 287 

Qy 301 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 03 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j i r i r 1 1 1 1 1 1 1 1 f r 1 1 1 1 1 f 1 1 1 1 1 1 1 

Db 348 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 2 
AAM39186 

ID AAM39186 standard; Protein; 390 AA. 
XX 

AC AAM39186; 
XX 

DT 22-OCT-2001 (first entry) 
XX 

DE Human polypeptide SEQ ID NO 2331. 
XX 

KW Human; nootropic; immunosuppressant; cytostatic; gene therapy; cancer; 

KW peripheral nervous system; neuropathy; central nervous system; CNS; 

KW Alzheimer's; Parkinson's disease; Huntington's disease; haemostatic; 

KW amyotrophic lateral sclerosis; Shy-Drager Syndrome; chemotactic; 

KW chemokinetic; thrombolytic; drug screening; arthritis; inflammation; 



KW leukaemia. 
XX 

OS Homo sapiens. 
XX 

PN WO200153312-A1 . 
XX 

PD 26-JUL-2001. 
XX 

PF 26-DEC-2000; 2 0 00WO-US342 63 . 
XX 

PR 21-JAN-2000; 2000US-0488725 . 

PR 25-APR-2000; 2000US-0552317 . 

PR 09-JUL-2000; 2 000US- 0598 042 . 

PR 19-JUL-2000; 2 0 0 OUS - 062 03 12 . 

PR 03-AUG-2000; 2 0 OOUS - 06 534 5 0 . 

PR 14-SEP-2000; 2 00 OUS- 0662 1 9 1 . 

PR 19-OCT-2000; 2000US-0693036 . 

PR 29-NOV-2000; 2 000US - 0727344 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Asundi V, Chen R, Ma Y, Qian XB, Ren F, Wang D; 

PI Wang J, Wang Z, Wehrman T, Xu C ( Xue AJ, Yang Y, Zhang J; 

PI Zhao QA, Zhou P, Goodrich R, Drmanac RT; 
XX 

DR WPI; 2001-442253/47. 

DR N-PSDB; AAI58342. 
XX 

PT Novel nucleic acids and polypeptides, useful for treating disorders 

PT such as central nervous system injuries - 

XX 

PS Example 4; SEQ ID NO 2331; 10078pp ; English. 
XX 

CC The invention relates to human nucleic acids (AAI57798-AAI61369) and 

CC the encoded polypeptides (AAM38642 -AAM42213 ) with nootropic, 

CC immunosuppressant and cytostatic activity. The polynucleotides are useful 

CC in gene therapy. A composition containing a polypeptide or polynucleotide 

CC of the invention may be used to treat diseases of the peripheral nervous 

CC system, such as peripheral nervous injuries, peripheral neuropathy and 

CC localised neuropathies and central nervous system diseases, such as 

CC Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 

CC lateral sclerosis, and Shy-Drager Syndrome. Other uses include the 

CC utilisation of the activities such as: Immune system suppression, 

CC Activin/inhibin activity, chemotactic/chemokinetic activity, haemostatic 

CC and thrombolytic activity, cancer diagnosis and therapy, drug screening, 

CC assays for receptor activity, arthritis and inflammation, leukaemias and 

CC C.N.S disorders. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification. 

XX 

SQ Sequence 3 90 AA; 

Query Match 90.2%; Score 1928.5; DB 22; Length 390; 

Best Local Similarity 90.8%; Pred. No. 7.2e-165; 

Matches 366; Conservative 10; Mismatches 14; Indels 13; Gaps 1; 



QY 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 6 0 



Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


288 


Qy 


361 


Db 


348 



I II I 'II I II II I III I II I II I II II II I II I III ill I II ! 1 1 M 1 1 1 II 'II 

MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 
SPPSQGDVPPGPLPEAVIALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

'1 1 1 1 1 H 1 1 1 1 1 1 1'l I I I M, I I I M 'I I I I I I INI I II I I I M 1 1 1 M ^ hi 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

Mill : Ihll , II ' ll! 

YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 



I I I I I I M I I I I III I II I M I I M I I IM UNI I Ihllll hill 



SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 300 

M M M M 1 1 1 IM M M I M 1 1 M 1 1 1 1 II MMII MINIMI 

TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 



I h 1 1 Ml I h h 1 1 1 1 1 1 !l I Ml h I li i I h 1 1 1 Ml II 1 1 1 1 1 M 1 1 1 h 1 1 1 1 1 1 



PGASAAPCCVPQALE PLP I VYYVGRKP KVEQLSNM I VRS CKCS 4 03 

II II M M I II III II M 1 1 II Ml I II Ml 1 1 M III IM II 

PGASAAPCCVPQALE PLP I VYYVGRKPKVEQLSNM I VRSCKCS 3 90 



RESULT 3 
AAP61468 

ID AAP61468 standard; Protein; 390 AA. 
XX 

AC AAP61468; 
XX 

DT 31-OCT-2002 (updated) 

DT 28-OCT-1991 (first entry) 

XX 

DE PreTGF-beta gene product. 
XX 

KW Transforming growth factor beta; cancer; wound healing. 
XX 

OS Unidentified. 
XX 

FH Key Location/Qualifiers 

FT Protein 279 . .390 

XX 

PN EP200341-A. 
XX 

PD 10-DEC-1986. 
XX 

PF 21-MAR-1986; 86EP- 0302 1 12 . 
XX 

PR 22-MAR-1985; 85US-0715142 . 
PR 13-MAR-1987; 87US- 0025423 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 



PI Derynck RMA; 
XX 

DR WPI; 1986-326875/50. 

DR N-PSDB; AAN60972 . 
XX 

PT TGF-beta prodn . from transformed hosts - useful esp. for treating 

PT wounds (J6 2/9/86) . 

XX 

PS Disclosure; Fig lb; 26pp; English. 
XX 

CC The gene product is known to stimulate cell proliferation and 

CC inhibit anchorage -dependent growth of a variety of human cancer cell 

CC lines, it is esp. useful in treatment of burns and the promotion of 

CC surface and internal wound healing. TGF-beta may be expressed from a 

CC transformed CHO cell line. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 
XX 

SQ Sequence 390 AA; 

Query Match 8 9.8%; Score 1921.5; DB 7; Length 3 90; 

Best Local Similarity 90.6%; Pred. No. 3.1e-164; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I M M M M II M I II 1 11 II M 1 1 1 1 1 1 II II 1 1 1 II M 1 1 II M I M II M I ! 1 1 1 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1 1 M MM II I M I M I II MM I IM 1 1 M IIIIIMIIMIIIIIIIIMh hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill = Ihll I M, I I I I I I I I I I I I I, I I I ! I I I I II I I I II II ' I I I ' h I! I 
Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDS PEWLSFDVTG WRQWLTRREAI EGFRLSAHSSSDS KDNTLHVE I NGFN 240 

III Mill I III I II II I III II III I IH II II Ml II I Ihll II hllll 

Db 181 YLSNRLLAPSDS PEWLSFDVTGVVRQWLSRGGE I EGFRLSAHCSCDSRDNTLQVD I NGFT 240 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

: 1 1 1 II IN II I M 1 1 1; I MM 1 1 II I II I 1 1 1 1 M 1 1 1 1 ; 1 1 [ i 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 28 7 

Qy 301 STEraCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

I M M I M MM M I M I II MM 1 1 II II II 1 1 M I M MM I M II I M I II M 1 1! 

Db 28 8 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 34 7 

Qy 361 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 03 

I II I M ll MM I II Ml I II I M MM 1 1 M 1 1 1 1 M I MM 

Db 34 8 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 4 
AAR04034 

ID AAR04034 standard; protein; 390 AA. 
XX 

AC AAR04034; 



XX 






DT 


25-MAR-2003 


(updated) 


DT 


31-OCT-2002 


(updated) 


DT 


31-MAY-1989 


(first entry) 


XX 






DE 


Sequence of 


pre-TGF-beta 1. 


XX 






KW 


Transforming growth factor beta-3 (TGF beta 3); tumour cells; growth 


KW 


inhibition . 




XX 






OS 


Homo sapiens 




XX 






FH 


Key 


Location/Qualifiers 


FT 


CDS 


348 . . BOO 


XX 






PN 


WO8912101-A. 




XX 






PD 


14-DEC-1989. 




XX 






PF 


08-JUN-1988; 


88WO-US01945. 


XX 






PR 


08-JUN-1988; 


88WO-US01945 . 


XX 






PA 


(GETH ) GENENTECH INC. 


XX 






PI 


Dernyck RMA, 


Goeddel DV ; 


XX 






DR 


WPI; 1990-007474/01. 


DR 


N-PSDB; AAQ02815. 


XX 






PT 


Nucleotide sequence encoding transforming growth factor beta-3 used as a 


PT 


probe, or to produce TGF beta 3, for inhibiting growth of certain normal 


PT 


and neoplastic cells, eg A549. 


XX 






PS 


Disclosure; 


Fig. 2; 61pp; English. 


XX 






CC 


Sequence is 


an exon of transforming growth factor-beta 1 (pre-TGF-beta 


CC 


1) polypeptide and corresponds to AA's 288-338 of mature TGF-beta 1. The 


CC 


nucleic acid 


I encoding second subtype of TGF-beta (TGF-beta 3) is useful 


CC 


as a probe or to produce TGF-beta 3 for inhibition of normal and 


CC 


neoplastic cell growth. 


CC 


(Updated on 


31-OCT-2002 to add missing OS field.) 


CC 


(Updated on 


25-MAR-2003 to correct PR field.) 


CC 


(Updated on 


25-MAR-2003 to correct PI field.) 


XX 






SQ 


Sequence 390 AA; 



Query Match 8 9.8%; Score 1921.5; DB 11; Lenqth 3 90; 

Best Local Similarity 90.6%; Pred. No. 3.1e-164; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; Gaps 1; 



Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I IIIIIIIIIMIII1IIIIIII IIIIMIIIIIIIMIIIMIIIIIIMIIIIIII 

Db 1 MPPSGLRLLPLLLPLLWLLVIjTPGPPAAGLSTCKT I DMELVKRKRI EAI RGQI LSKLRLA 60 



Qy 



61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

II III M MM 1 1 1 II I ii ' IT 1 1 1 , M ' MIIIIIMIIIIIIIIIIIIh hi 



Db 



61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 



Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

INN II Ml MMIMIMMMIMMMMIMMIMMMMIMMMM 

Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

MMMMMMIMMMMMMIMM II Ml MM I MM Ml I Mill 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

M M I II M I M I M I II II II I II II II II 1 1 M M IMIMIM 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 

Qy 301 STEKNCCVRQLYIDFRKI3LGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

I M II II 1 1 1 II II M I M 1 1 II M I II I II 1 1 M II I M II I II M II II II M I M M 

Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPY1WSLDTQYSKVLALYNQHN 347 

Qy 3 61 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 4 03 

1 1 II M II M II 1 1 II II II I II II M 1 1 1 M I II II II M 1 1 

Db 348 PGASAAPCCVPQALEPLP I VY YVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 5 




AAR05258 




ID 


AAR05258 standard; protein; 390 AA. 


XX 






AC 


AAR05258; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


05-AUG-1990 


(first entry) 


XX 






DE 


Human pre-transforming growth factor-beta- 1 (pre-TGF-beta- 1 ) 


XX 






KW 


Transforming < 


growth f actor-beta-1 (TGF-beta- 1 ) ; 


KW 


neoplastic cell line inhibition; 


KW 


EGF-potent iated anchorage- independent growth. 


XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Locat ion/Qua 1 if lers 


FT 


Peptide 


1 . .278 


FT 


Protein 


279. .2011 


FT 


Domain 


8 . .23 


FT 




/not e= "hydrophobic domain" 


FT 


Modif ied-site 


82. . 84 


FT 




/note="potent ial N-glycosylat ion site " 


FT 


Modif ied-site 


136. .138 


FT 




/note= M as above" 


FT 


Modif ied-site 


176 . . 178 


FT 




/note="as above" 


FT 


Cleavage-site 


277. .278 


FT 




/note="proteolytic cleavage site" 


XX 






PN 


US4886747-A. 




XX 






PD 


12-DEC-1989 . 





XX 

PF 13-MAR-1987; 87US - 0025423 . 
XX 

PR 13-MAR-1987; 87US-0025423 . 

PR 22-MAR-1985; 85US-0715142 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1990-051338/07. 

DR N-PSDB; AAQ93301. 
XX 

PT Nucleic acid encoding transforming growth factor-beta - 

PT cloned into expression vectors for expression in eukaryotic host 

PT cells for therapeutic use 

XX 

PS Disclosure; Fig lb; 28pp; English. 
XX 

CC Pre-TGF-beta-1 is a polypeptide of 390 amino acids. Post- transitional 

CC cleavage of the precursor gives rise to the mature TGF-beta monomer. 

CC The sequence for human TGF-beta was determined by direct amino acid 

CC sequence analysis and by deduction from the TGF-beta cDNA. It is 

CC capable of inducing EGF-potentiated anchorage -independent growth of 

CC target cell lines, and/or growth inhibition of neoplastic cell lines. It 

CC can be used for treating wounds, eg burns or epidermal ulcers. 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 3 90 AA; 

Query Match 8 9.8%; Score 1921.5; DB 11; Length 3 90; 

Best Local Similarity 90.6%; Pred. No. 3.1e-164; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 



Db 




Qy 



61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 



Db 




Qy 



121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 



Db 




Qy 



181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 240 



Db 



181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 



Qy 



241 SGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 



Db 



241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 




Qy 301 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

I I I I I I I I I I I I II II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II 
Db 2 88 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 34 7 



Qy 361 PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 403 

1 1 1 1 II I Ml MM 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 M 1 1 1 1 III I 

Db 348 PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 6 
AAR13813 

ID AAR13813 standard; Protein; 390 AA . 
XX 

AC AAR13813; 
XX 

DT 20-NOV-1991 (first entry) 
XX 

DE Human pro-TGF-beta 1. 
XX 

KW Osteogenetic ; tumoricidal. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT Peptide 1 . . 29 

FT /note= "signal peptide" 

FT Peptide 30. .390 

FT /note= "pro-TGF-beta 1" 

FT Peptide 279.. 390 

FT /note= "TGF-beta 1" 

XX 

PN JP03180192-A. 
XX 

PD 06-AUG-1991 . 
XX 

PF 07-DEC-1989; 8 9 JP- 03 18243 . 
XX 

PR 07-DEC-1989; 8 9 JP- 03 18243 . 
XX 

PA (KIRI ) KIRIN BREWERY KK. 
XX 

DR WPI; 1991-271579/37. 
DR N-PSDB; AAQ13392. 
XX 

PT Human pro-TGF-beta 1 prodn., for osteo-genet ic activity - by 
PT preparing DNA chain contg. base sequence coding for human 
PT pre: pro-TGF-beta 1, forming expression vector etc. 
XX 

PS Claim 1; Fig 1; 16pp; Japanese. 
XX 

CC The amino acid sequence codes for human prepro-TGF-beta 1 which 
CC can be produced by recombinant methods, it has osteogenetic and 
CC tumoricidal activity. 
XX 

SQ Sequence 3 90 AA; 

Query Match 89.8%; Score 1921.5; DB 12; Length 390; 

Best Local Similarity 90.6%; Pred . No. 3.1e-164; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; 



Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I llllllllllllllllllllll 1 1 II M 1 1 1 1 1 !l ' III I : l I I I I I I II I I I I I 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

M 1 1 1 h 1 1 r 1 1 1 1 1 1, 1 1 1 : 1 1 1 1 1 M I I iMMMiiMiiiiiiiiih hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll I M 1 1 1 1 1 1 1 M II 1 1 ! 1 1 1 M 1 1 1 1 II 1 1 M M I ! I M I M : 1 1 1 

Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 240 

Illllllllllllllllllllllllllhl lllllllll I Ihllll hllll 
Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

h 1 1 1 1 II I h I ! 1 1 M 1 1 1 ' II 1 1 1 h I Mill II II M II I 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 2 87 

Qy 3 01 STEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHN 3 60 

Mill III II Ml III IMMIMIM III Mill IIIMIIII Mill II II MM III 

Db 2 88 STEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 403 

I I I I I II I h h I I I hi I I ' II I I I h I I h I I I I I hli 
Db 348 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 390 



AAR73596; 

25-MAR-2003 
20-DEC-1995 



(updated) 
(first entry) 



RESULT 7 
AAR73596 

ID AAR73596 standard; Protein; 
XX 
AC 
XX 
DT 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
XX 



390 AA. 



Human TGF-beta 1 protein. 

Transforming growth factor-beta; Human TGF-beta protein; TGF-beta 1; 
TGF-beta 3; osteogenic cell source; OCS; bone deficiency; 
bone- inducing cof actor. 

Homo sapiens. 

US5409896-A. 

25-APR-1995. 

12 -NOV- 1993; 93US - 0 1324 05 . 



01-SEP-1989; 
12-NOV-1991; 
18-MAY-1993; 
12-NOV-1993; 



89US-0401906 . 
91US-0790856. 
93US-0063841. 
93US-0132405. 



PA (GETH ) GENENTECH INC. 
XX 

PI Ammann AJ, Rudman CG; 
XX 

DR WPI; 1995-169610/22. 
XX 

PT Compsn. for treating skeletal tissue deficiency - comprising 

PT transforming growth factor-beta and an osteogenic cell source in a 

PT carrier 

XX 

PS Claim 3; Column 15-18; 19pp; English. 
XX 

CC This sequence represents human transforming growth factor-beta 1 

CC (TGF-beta 1) . This sequence and the sequence for human TGF-beta 3 (see 

CC AAR73598) are claimed within the scope of the invention. The invention 

CC is a composition consisting of a TGF-beta protein and an osteogenic cell 

CC source (OCS) formulated in an acceptable carrier other than a bone 

CC morphogenic cof actor. This composition can be used for the restoration 

CC of bone deficiency. This provides for the generation of mature bone 

CC only where it is required, without the inclusion of a specific 

CC bone- inducing cof actor. This method can be used with any of the 5 human 

CC TGF-beta • s or with TGF-beta from other species. 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 3 90 AA; 



Query Match 89.8%; Score 1921.5; DB 16; Length 3 90; 

Best Local Similarity 90.6%; Pred. No. 3.1e-164 ; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; Gaps 1; 



Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I II M M 1 1 M II I! I ! M I M I 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1 1. II 1 : 1 1 M i ' 1 1 ! 1 M . 1 1 1 1 M M 1 1 1 1 M 1 II II 1 1 1 II 1 1 1 1 M 1 1 1 1 : hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll IIIMIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIMIIhlll 

Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

IIMIIIIIIIMIIIIIIMIIIIIIhl MINIMI I 1 1 : 1 1 1 1 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEI EGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLAT I HGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

: 1 1 1 1 M I II M I II 1 1 M M I II 1 1 1 1 1 1 1 llllll IMI ! MM 

Db 241 TGRRGDLATI HGMNR PFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 

Qy 301 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

II 1 1 1 1 II I II I M II II II M II 1 1 1 1 1 1 M M 1 1 II II I II I M I II II 1 1 1 M 1 1 II 

Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLP I VY YVGRKPKVEQLSNMI VRSCKCS 4 03 

1 1 1 1 1 1 If 1 1 1 1 1 1 1 1 ! I ! 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 { I M 1 1 

Db 34 8 PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 8 
AAR90827 

ID AAR90827 standard; Protein; 390 AA. 
XX 

AC AAR90827; 
XX 

DT 25-MAR-2003 (updated) 

DT 25-JAN-1980 (first entry) 

XX 

DE Pre-transforming growth factor beta 1. 
XX 

KW transforming growth factor beta 1; wound healing; 

KW recombinant production. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT Domain 8 . .23 

FT /note= "hydrophobic domain" 

FT Modif ied-site 82.. 84 

FT /note= "potential N-glycosylat ion site" 

FT Modif ied-site 136.. 138 

FT /note= "potential N-glycosylat ion site" 

FT Modif ied-site 176.. 178 

FT /note= "potential N-glycosylat ion site" 

FT Cleavage-site 277.. 279 

FT /note= "trypsin-like peptidase cleavage site" 

FT Protein 279 . .390 

FT /label = mature_TGF_beta_l 

XX 

PN US5482851-A. 
XX 

PD 09-JAN-1996. 
XX 

PF 05-NOV-1993; 93US-0147364 . 
XX 

PR 13-MAR-1987; 87US- 0025423 . 
PR 22-MAR-1985; 8 5US- 0715142 . 
PR 04-AUG-1989; 8 9US - 03 8 9 92 9 . 
PR 04-MAR-1992; 92US - 084 58 93 . 
PR 05-NOV-1993; 93US- 0147364 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1996-076891/08. 
DR N-PSDB; AAT15720. 
XX 

PT New recombinant human transforming growth factor-beta prods. - produced 
PT using Chinese hamster ovary cells, for use in diagnostic applications 
PT or in therapy 
XX 

PS Example 3; Fig 1A-C; 2 6pp ; English. 
XX 



CC The pre- trans forming growth factor (TGF) beta 1 protein is encoded 

CC by AAT15720. The mature TGF beta 1 monomer is cleaved from the 

CC precursor at the Arg-Arg dipeptide immediately preceding the mature 

CC TGF-beta 1 NH2- terminus . It does not contain a recognisable N-terminal 

CC signal peptide typical of most secreted proteins. The pre-TGF beta 1 

CC contains several pairs of basic residues which could undergo 

CC post-translational cleavage and give rise to separate polypeptide 

CC entities. The precursor contains 3 potential N-glycosylat ion sites, none 

CC of which are localised in the mature TGF beta 1. This is useful in 

CC purification of the mature protein. TGF beta 1 can be used in, e.g. wound 

CC healing. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 390 AA; 

Query Match 89.8%; Score 1921.5; DB 17; Length 390; 

Best Local Similarity 90.6%; Pred. No. 3.1e-164; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; Gaps 1 ; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIIIIMMIIIMIMIIII IIIIIMIIMMIIIIMMMIIIIIIIIIIII 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

!: 1 1 1 h i m ill 1 1 m i n 1 1 ii 1 1 m 1 1 1 1 1 1 i i ii i ii irr 1 1 1 1 1 1 1 h hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll IIMIMIIMIIIIMMIIIIIIIIIIIIIIIIIIIIIIhlll 

Db 121 YDKFKQSTHS I YMFFNTS ELREAVPE PVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

I I I M I I I I I I M I I I M I ' I II I I I I I-' I lllllllll I Ihllll hllll 
Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

■ 1 1 , 1 1 1 ' 1 1 1 1 1 1 1 . 1 ' ! 1 1 1 1 I , I , I llllll lllllllll 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 

Qy 301 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

1 1 1 1 1 1 II 1 1 1 1 1 ! 1 1 [ I II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 

Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 3 61 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 403 

i: 1 1 M ! h I'll I Ml 1 1, 1 1 1 1 1 1 M 1 1 II 1 1. 1 'I I II I 

Db 34 8 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 9 
AAU77101 

ID AAU77101 standard; Protein; 390 AA. 
XX 

AC AAU77101; 
XX 

DT 05-JUN-2002 (first entry) 
XX 

DE Human transforming growth factor beta 1 (TFG-beta-1) polypeptide. 



XX 

KW Human; transforming growth factor beta; TGF-beta; insulin production; 

KW type I diabetes mellitus; pancreatic cell outgrowth; wound healing; 

KW pancreatic duct tissue; ischaemia; stroke; nervous system aging; 

KW neurological condition; neurodegenerative disease; inflammation; 

KW vasal injury; chemical injury; traumatic injury; tumour- induced injury ; 

KW amyotrophic lateral sclerosis; spinocerebellar degeneration; 

KW immunological disease; multiple sclerosis; TGF-beta- 1. 

XX 

OS Homo sapiens. 
XX 

PN WO200212336-A2. 
XX 

PD 14-FEB-2002. 
XX 

PF 09-FEB-2001; 2001WO-US04 192 . 
XX 

PR 09-AUG-2000; 2 000US- 0635368 . 
XX 

PA ( CUR I - ) CURIS INC. 
XX 

PI Wang M, Pang K; 
XX 

DR WPI; 2002-257468/30. 
XX 

PT Treating a subject with a disorder resulting from insufficient insulin 

PT production, and inducing outgrowth of pancreatic cells, involves using 

PT a transforming growth factor beta therapeutic 
XX 

PS Disclosure; Fig 1; 77pp; English. 
XX 

CC The invention relates to treating a subject with a disorder resulting 

CC from insufficient insulin production, involving contacting the subject 

CC with a transforming growth factor beta (TGF-beta) therapeutic. TGF-beta 

CC polypeptides can be used for treating a subject with a disorder resulting 

CC from insufficient insulin production, e.g. type I diabetes mellitus, and 

CC for inducing outgrowth of pancreatic cells associated with pancreatic 

CC duct tissue within a subject. A composition comprising a TGF-beta protein 

CC may be useful in wound healing and treatment of neurological conditions 

CC derived from acute, subacute or chronic injury to the nervous system, 

CC including traumatic injury, chemical injury, vasal injury and deficits 

CC (such as ischaemia resulting from stroke), together with 

CC infectious/inflammatory and tumour- induced injury, aging of the nervous 

CC system including Alzheimer's disease, chronic neurodegenerative diseases 

CC including Parkinson's disease, Huntington's chorea, amyotrophic lateral 

CC sclerosis, spinocerebellar degenerations and chronic immunological 

CC diseases of the nervous system or affecting the nervous system, including 

CC multiple sclerosis. This sequence represents the human TGF-beta-1 

CC protein. 

XX 

SQ Sequence 390 AA; 



Query Match 89.7%; Score 1918.5; DB 23; Length 390; 

Best Local Similarity 90.6%; Pred . No. 5.7e-164; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; Gaps 1 ,- 



Qy 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAIRGQI LSKLRLA 60 



Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


288 


Qy 


361 


Db 


348 



I Mlllll 1 1 1 Ml I Ml 1 1 1 II I M 1 1 1 1 1 Ml M Ml M M 1 1 1 M II 1 1 1 1 1 Ml 

MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAI RGQILSKLRLA 60 
SPPSQGDVPPGPLPEAVIALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1 1 1 MM 1 1 1 1 1 Ml 1 1! 1 1 1 1 1 1 1 1 1 1 II i I I.I I III I ! 1 1 1 Ml 1 1 1 1 h hi 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 
YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : M M I 1 1 1 1 1 1 1 1 1 M 1 1 r I I I I I I I I I I I I I I I I I I I I I II I I I I I ^ I I I 

YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 
YLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEI NGFN 24 0 

I Ml M M II Ml MM II II M MM Ml MINIMI I I Ml Ml MINI 

YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 
SGRRGDLiATI HGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

^ 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMI IIIIIIMI 

TGRRGDLATI HGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 28 7 

STEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 36 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 M f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 

STEKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 34 7 

PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 03 

I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 10 
AAE16943 

ID AAE16943 standard; Protein; 390 AA. 
XX 

AC AAE16943; 
XX 

DT 18-APR-2002 (first entry) 
XX 

DE Human transforming growth factor-betal (TGF-betal) protein. 
XX 

KW Human; transforming growth factor-betal; TGF-betal; osteoporosis; 

KW latency associated peptide; LAP; integrin alphavbeta3; apoptosis; 

KW immunomodulation; inflammatory disease; fibrotic disease; cancer; 

KW diabetic retinopathy; chronic obstructive pulmonary disorder; 

KW bone resorption; rheumatoid arthritis; psoriasis; restenosis; 

KW atherosclerosis; liver fibrosis; asthma; cytostatic; osteopathic; 

KW ophthalmol ogical ; antiarteriosclerot ic ; vasotropic . 
XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT Peptide 1. .29 

FT /label= Signal_peptide 

FT Region 30. .278 

FT /note= "LAP-betal" 

FT Domain 244 . . 246 

FT /note= "RGD motif" 

FT Protein 279. .390 

FT /note= "Human mature TGF-betal protein" 



XX 

PN WO200190186-A1 . 
XX 

PD 29-NOV-2001. 
XX 

PF 25-MAY-2001; 2001WO-GB02352 . 
XX 

PR 26-MAY-2000; 2000GB-0012991 . 

PR 05-JAN-2001; 2001GB-0000286 . 
XX 

PA (GLAX ) GLAXO GROUP LTD . 
XX 

PI Ludbrook S, Barry S, Horgan C, Miller D; 
XX 

DR WPI; 2002-097645/13. 
XX 

PT Identifying modulators of interactions between latency associated 

PT peptides and integrin alphavbeta3 for therapeutics, by contacting the 

PT peptide and integrin with a test product and determining if the product 

PT modulates interaction - 

XX 

PS Disclosure; Page 37-39; 44pp; English. 
XX 

CC The invention relates to a method for identification of a modulator 

CC of the interaction between latency associated peptide (LAP) of 

CC transforming growth factor-betal (TGF-betal) and integrin alphavbeta3 . 

CC The method is useful for identifying a modulator of the interaction 

CC between LAP and integrin alphavbeta3 . The method is useful for 

CC immunomodulation, in the treatment of inflammatory disease, fibrotic 

CC disease, cancer, diabetic retinopathy, bone resorption or osteoporosis, 

CC and for preventing apoptosis administering the modulator to the host. 

CC The modulator (inhibitor of the interaction between LAP-betal and 

CC integrin alphavbeta3) is useful in the manufacture of a medicament for 

CC immunomodulat ion . The modulator (activator of the interaction between 

CC LAP-betal and integrin alphavbeta3 ) is useful in the manufacture of 

CC medicament for preventing apoptosis. The modulator is useful for 

CC treating a inflammatory or fibrotic disease such as chronic obstructive 

CC pulmonary disorder, rheumatoid arthritis, psoriasis, restenosis, 

CC atherosclerosis, liver fibrosis and asthma. The present sequence is 

CC human TGF-betal protein. 

XX 

SQ Sequence 3 90 AA; 

Query Match 89.7%; Score 1918.5; DB 23; Length 390; 
Best Local Similarity 90.6%; Pred . No. 5.7e-164; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I lllllll Ml I II M II II I! II Ml III I II I Mill I Ml M II 1 1 1 1 II I M I 

Db 1 MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKT I DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1 1 1 1 h 1 1 1 1 1 1 II ! 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 lllllllllllllllllllllh hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 



Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

INN : ||: II II I II M I II ! I Ml I 1 1 MM II III I I M II I III I I li: Ml 



Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

I M M II 1 1 1 M M II 11 M I M 1 1 1 1 1 U II II Ml!! I 1 1 1 1 1 1 I 1 1 1 1 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEI EGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

MIIIIIIIIIIIIIIIMIIMIIIIIIII MINI MM MM 

Db 241 TGRRGDLAT1HGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 

Qy 3 01 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 403 

I M M M I II II II M II I M I M M M M M I M II II M II 

Db 348 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 11 
ABB82780 

ID ABB82780 standard; Protein; 391 AA. 
XX 

AC ABB82780; 
XX 

DT 18-MAR-2003 (first entry) 
XX 

DE TGFB1 Arg25Pro polymorphism G-allele protein sequence. 
XX 

KW Polymorphism; transforming growth factor beta 1; TGFB1 ; TGFbeta-1; 

KW renal failure; nephrotropic; human; allele. 

XX 

OS Homo sapiens. 
XX 

PN WO200290585-A2 . 
XX 

PD 14-NOV-2002. 
XX 

PF 08-MAY-2002; 2002WO-GB02 066 . 
XX 

PR 09-MAY-2001; 2001GB- 001 1277 . 
XX 

PA (UYSH-) UNIV SHEFFIELD HALLAM . 
XX 

PI El-Nahas AM, Blakemore A, Khalil MS; 
XX 

DR WPI; 2003-120560/11. 
DR N-PSDB; ABV75391. 
XX 

PT Determining an individual's susceptibility to the progression of renal 
PT failure comprises detecting the presence of a genetic polymorphism 
PT pattern in transforming growth factor beta 1 (TGFB1) gene in a sample 
PT from the individual 
XX 

PS Claim 51; Page 59-61; 62pp; English. 
XX 

CC The invention relates to determining an individual's susceptibility to 



CC renal failure and invovles detecting the presence of a genetic 

CC polymorphism pattern in transforming growth factor beta 1 (TGFB1) gene in 

CC a sample from an individual , where polymorphism pattern is associated 

CC with renal failure. The method is useful for determining an individual's 

CC susceptibility to the progression of renal failure. The nucleic acid 

CC comprising a T(-509)C polymorphism of TGFB1 gene, or a polypeptide 

CC comprising a sequence of 391 amino acids is useful for preparing a 

CC medicament for retarding or preventing the progression of renal disease, 

CC and for drug research purposes for retarding or preventing the 

CC progression of renal disease. Sequences ABV75386-88 represents the 

CC protein sequence for the TGFB1 G-allele of the Arg25Pro polymorphism 

CC of exon 1. 

XX 

SQ Sequence 3 91 AA; 



Query Match 89.7%; Score 1918; DB 24; Length 391; 

Best Local Similarity 90.6%; Pred . No. 6.4e-164; 

Matches 366; Conservative 10; Mismatches 14; Indels 14; Gaps 2; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I II M I MM I Ml II M IM I II M II II M I II 1 1 1 I III I II 1 1 1 MMI M IM 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 S P PSQGDVP PG PLPEAVLALYNSTRDRVAGES VE PE PE PEADY YAKE VTRVLMVESGNQ I 12 0 

I I IM M III 1 1 II I II II I I M 1 1 M IM I 1 1 1 1 Mi I M 1 1 1 1 Ml 1 1 1'h Ml 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPE PEADY YAKE VTRVLMVETHNE I 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSW 179 

MMI : MM M 1 1 MM I M II II II M IMM 1 1 M II M I M 1 1 M 1 1 M 1 1 

Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 18 0 

Qy 180 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGF 239 

1 1 M II MM M MM I M I M II I MM MM MINIMI I IMM MIMI 

Db 181 RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 24 0 

Qy 240 NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCF 2 99 

= 1 1 1 1 1 1 1 1 1 1 1 1 1 If 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MIMI MIIIMI 

Db 241 TTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCF 28 7 

Qy 300 SSTEKNCCVRQLY I DFRKDLGWKW I HE PKGYHANFCLGPCPYI WSLDTQYSKVLALYNQH 359 

II 1 1 II I M 1 1 |M IM M I 1 1 Ml II 1 1 IM I II 1 1 1 MMI 1 1 1 II I Ml III I IM 

Db 288 SSTEKNCCVRQLY I DFRKIDLGWRW I HEPKGYHANFCLGPCPY I WSLDTQYSKVLALYNQH 347 

Qy 3 60 NPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 03 

MM IM I MM I III II MIMI M I M II IM I II M I MM I 

Db 348 NPGASAAPCCVPQALEPLP1VYYVGRKPKVEQLSNMIVRSCKCS 391 



RESULT 12 
AAR20124 

ID AAR20124 standard; Protein; 390 AA. 
XX 

AC AAR20124; 
XX 

DT 25-MAR-2003 (updated) 

DT 16-APR-1992 (first entry) 

XX 



DE Sequence of simian transforming growth factor (TGF) beta-1. 
XX 

KW Hypertension therapy; hypotensive agent; blood pressure modulator. 
XX 

OS Monkey . 
XX 

FH Key Location/Qualifiers 

FT Peptide 8 . .21 

FT Protein 279 . .390 

XX 

PN W09119513-A. 
XX 

PD 26-DEC-1991. 
XX 

PF 20-JUN-1991; 91WO-US0444 9 . 
XX 

PR 20-JUN-1990; 9 OUS - 054 122 1 . 
XX 

PA (BRIM ) BRISTOL-MYERS SQUIBB CO. 
XX 

PI Oleson FB, Comereski CR; 
XX 

DR WPI; 1992-024199/03. 

DR N-PSDB; AAQ20289. 
XX 

PT Use of transforming growth factor (TGF) -beta and their 

PT antagonists - for modulating blood pressure, for treating 

PT hypertension and hypotension 
XX 

PS Disclosure; Fig 1; 42pp; English. 
XX 

CC A new method for treating hypertension comprises administering a 

CC transforming growth factor (TGF) -beta to an individual at a dose 

CC effective for lowering blood pressure; the TGF-beta may be e.g. 

CC mature TGF-beta, TGF-beta2, a mature TGF-betal/beta2 hybrid, TGF- 

CC betal precursor, a latent TGF-beta2 precursor, hybrid TGF -beta 1 /TGF - 

CC beta2 precursor, a latent TGF-betal complex or a latent TGF-beta2 

CC complex. 

CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 390 AA; 

Query Match 89.6%; Score 1916.5; DB 13; Length 390; 

Best Local Similarity 90.3%; Pred. No. 8.7e-164; 

Matches 364; Conservative 10; Mismatches 16; Indels 13; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRIA 60 

I lllllllllllllllllllll ' I II I I I I I , I I Tl II I I II i lilllllllMI 
Db 1 MPPSGLRLLPLLLPLLWLLVX.TPSRPAAGLSTCKTIDMELVKRKRIETIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

' 1 I I Mill hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill = I h II II II II 1 1 II II M M II II I II 1 1 1 II I M II M II II II h II I 

Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 



Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

1 1 1 . n 1 1 1 mi 1 1 1 1 1 , 1 1 1 1 1 1 1 m iiiiiiiii i iiiiiii him 

Db 181 YLSNRLIAPSNSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSKDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

h 1 1 1 1 1 1 I! I II 1 1 1 1 1 1 1 1 1 1! I h I < 1 1 I l! I h I'll 1 I ' I 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 

Qy 301 STEKNCCVRQLY I DFRKDLGWKW I HEPKG YHANFCLGPCP Y 1WSLDTQYS KVLALYNQHN 360 

II I I I Ihh II I Ihl I II I h Ih h II I II I I III I h I ' h II h hh II I II 
Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 03 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIM 

Db 34 8 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 390 

RESULT 13 
AAR46227 

ID AAR46227 standard; Protein; 390 AA. 
XX 

AC AAR46227; 
XX 

DT 25-MAR-2003 (updated) 

DT 09-JUL-1994 (first entry) 

XX 

DE Human pre-TGF-beta - 1 . 
XX 

KW TGF-beta-1; TGF-beta-2; transforming growth factor beta-1; 

KW transforming growth factor beta-3; recombinant; wound healing; 

KW vulnerary. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT Peptide 279. .390 

FT /label= Mat_peptide 

FT Cleavage-site 279 

FT /note= "TGF-beta-1 release site" 

FT Modif ied-site 82 . . 84 

FT /label= N-glycosylation_site 

FT Modif ied-site 136.. 138 

FT /label= N-glycosylation__site 

FT Modif ied-site 176.. 178 

FT /label= N-glycosylat ion_site 

XX 

PN US5284763-A. 
XX 

PD 08-FEB-1994. 
XX 

PF 04-MAR-1992; 92US - 084 58 93 . 
XX 

PR 22-MAR-1985; 85US- 0715142 . 

PR 13-MAR-1987; 87US- 0025423 . 

PR 04-AUG-1989; 89US- 038 992 9 . 

PR 04-MAR-1992; 92US- 084 58 93 . 



XX 

PA (GETH ) GENENTECH INC . 
XX 

PI Derynk RMA, Goeddel DV; 
XX 

DR WPI; 1994-056343/07. 

DR N-PSDB; AAQ56923 . 
XX 

PT Nucleic acid sequences encoding transforming growth factor-beta - 

PT diagnostic probes, and for use in therapeutics 

XX 

PS Disclosure; Fig lb; 25pp; English. 
XX 

CC cDNA sequences were determined for human pre-TGF-beta- 1 (AAQ56923) , 

CC pig TGF-beta-3 (AAQ56925) and human TGF-beta-3 (AAQ56926) , and the 

CC corresponding amino acid sequences were determined (AAR46227 -29 , 

CC respectively) . A genomic fragment corresponding to a human TGF- 

CC beta-1 exon (AAQ56924) was also isolated and its amino acid sequence 

CC determined (AAR46230) . The sequences have been used in the 

CC construction of vectors for the expression of recombinant TGF- 

CC beta . 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 3 90 AA; 

Query Match 89.4%; Score 1912.5; DB 15; Length 390; 

Best Local Similarity 90.3%; Pred. No. 2e-163; 

Matches 364; Conservative 10; Mismatches 16; Indels 13; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I III 1 1 M 1 1 1 1 M I :M II M I 1 1 1 1 1 L 1 1 1 1, 1 1 1 II II Ml I < 1 1 1 1 : 1 1 1 II 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1 ! ' I M I II 1 1 1 1 1 1 M 1 1 , M II M I ; I I 1 1 1 Mil 1 M III 1 1 1 1 1 1 : hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEREPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

1 1 1 1 1 : M 1 1 M I II M I II 1 1 1 1 M 1 1 1 M 1 1 M M I M II 1 1 1 1 1 1 1 1 M II 

Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

MM MM I M i M 1 1 1 M M M MM M I f 1 1 1 1 1 1 1 I IMIMI MINI 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEI EGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

MIIMMIIMMIIMMMIMIIMM MUM MINIMI 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 

Qy 301 STEKNCC^QLYIDFRKDI^WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

II Ml M : I M I MM 1 1 III . II IM III I M 1 1 M III III M MM 1 1 M i M M M 

Db 28 8 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 34 7 

Qy 361 PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 403 

1 1 1 1 M M 1 1 1 1 1 1 II M M II I M II II M II 1 1 1 1 II I M I 

Db 34 8 PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 14 
AAR83054 

ID AAR83054 standard; Protein; 391 AA. 
XX 

AC AAR83 054; 
XX 

DT 25-JUN-1996 (first entry) 
XX 

DE Transforming growth factor-beta 1. 
XX 

KW macrophage inducible nitric oxide synthase; iNOS; constitutive NOS; 

KW interleukin-l-beta; transforming growth factor-beta; TGF-beta; ILl-beta; 

KW nitric oxide production; hypotension; inflammation; septic shock; 

KW treatment. 

XX 

OS Mammalian sp. 
XX 

FH Key Location/Qualifiers 
FT Protein 279.. 391 

FT /note= "represents the mature active TGF beta-1 mol . " 

XX 

PN W09526745-A1 . 
XX 

PD 12-OCT-1995. 
XX 

PF 05-APR-1994; 94WO-US03705 . 
XX 

PR 05-APR-1994; 94WO-US03705 . 
XX 

PA (HARD ) HARVARD COLLEGE. 
XX 

PI Lee M, Perrella MA; 
XX 

DR WPI; 1995-358443/46. 
DR N-PSDB; AAT05876. 
XX 

PT Treatment of hypotension, esp . in septic shock - by administering 
PT transforming growth factor-beta e.g. to inhibit inducible nitric 
PT oxide synthase gene transcription 
XX 

PS Disclosure; Fig 17; 52pp; English. 
XX 

CC Transforming growth factor-beta 1 (TGF-beta 1) has been found to inhibit 

CC inducible nitric oxide synthase (iNOS) gene transcription, esp. in 

CC interleukin-l-beta (ILl-beta) stimulated rat smooth muscle cells, and at 

CC a dose which does not inhibit consitutive NOS. TGF-beta 1 or 2 

CC (AAR83055) or their active fragments (esp. derived from the 

CC carboxy-terminal 112 amino acids) , can be used in the treatment of 

CC hypotension, such as that associated with severe inflammation or septic 

CC shock. 

XX 

SQ Sequence 391 AA; 



Query Match 89. 3% ; Score 1911; DB 16 

Best Local Similarity 90.3%; Pred . No. 2.7e-163 
Matches 365; Conservative 10; Mismatches 15 



Length 3 91; 

Indels 14; Gaps 2; 



Qy 


l 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 

1 MIIIMIIIIIMIMIIII 1 II il 1 II II 1 'Mill 

MP P SGLRLLPLLLPLLWLLVLTPGP PAAGLSTCKT I DMELVKRKR I EAI RGQI LSKLRLA 


60 


Db 


1 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

IMIIhllllllllllMIIIIIIIIIIIII lllllllllllllllllllllh hi 

S P P SQGE VP PG PLP E1AVLALYNSTRDRVAGESAEPE PE PEAD Y YAKE VTRVLM VETHNE I 


120 


Db 


61 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSW 

Mill : Ihll 1 1 1 1 1 II 1 1 1 1 1 M l 1 . i 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 M 1 1 M 
YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 


179 


Db 


121 


180 


Qy 


180 


RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGF 

1 M 1 M 1 1 1 1 M 1 1 Ml 1 1 II 1 1 M 1 II M lllllllll 1 MM Mill 

RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 


239 


Db 


181 


240 


Qy 


240 


NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCF 
• 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 MINI 1 1 1 1 1 1 1 1 


299 


Db 


241 


• 1 II II 1 1 II II II II II 1 II II II II II II 1 1 1 II 1 II 1 1 II 1 1 
TTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCF 


287 


Qy 


300 


SSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQH 

1 M 1 1 III 1 II II M 1 1 M Ml 1 M 1 1 IM 1 1 1 II 1 1 1 II 1 M M II 1 1 M 1 1 1 1 II 1 M 

SSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQH 


359 


Db 


288 


347 


Qy 


360 


NPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 4 03 

IM IMM III MM IMIMIIMI MIIMI II Ml MMM 

NPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 3 91 




Db 


348 





RESULT 15 




AAW78785 




ID 


AAW78785 standard; Protein; 390 AA. 


XX 






AC 


AAW78785; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


21-DEC-1998 


(first entry) 


XX 






DE 


Human pre-transf orming growth factor-beta 1. 


XX 






KW 


Transforming growth factor-beta 1; TGF-beta 1; human 


XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Locat ion/Qua 1 if iers 


FT 


Domain 


8 . .23 


FT 




/note= "hydrophobic domain" 


FT 


Protein 


279. .390 


FT 




/label= Mat protein 


FT 


Modif ied-site 


82. .84 


FT 




/note= "Asn is N-glycosylated" 


FT 


Modi f ied-site 


136. . 138 


FT 




/note= "Asn is N-glycosylated" 


FT 


Modif ied-site 


176. . 178 


FT 




/note= "Asn is N-glycosylated" 


FT 


Cleavage-site 


277. .278 



FT /note= "cleavage site for relase of TGF-beta 1" 
XX 

PN US5801231-A. 
XX 

PD 01-SEP-1998. 
XX 

PF 30-MAY-1995; 95US- 0454468 . 
XX 

PR 13-MAR-1987; 87US- 0025423 . 

PR 22-MAR-1985; 85US-0715142 . 

PR 04-AUG-1989; 8 9US- 038 992 9 . 

PR 04-MAR-1992; 92US-0845893 . 

PR 05-NOV-1993; 93US - 014 73 64 . 

PR 30-MAY-1995; 95US- 0454468 . 
XX 

PA (GETH ) GENENTECH INC . 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1998-494840/42. 

DR N-PSDB; AAV52933. 
XX 

PT DNA encoding transforming growth factor-beta precursor sequence - 

PT useful for analysis to perform manipulations to increase yield of 

PT recombinant production of the protein 
XX 

PS Example 3; Fig IB 1-3; 26pp; English. 
XX 

CC This is the amino acid sequence of human transforming growth 

CC factor-beta 1 precursor (preTGF-beta 1) . It was deduced from 

CC a preTGF-beta 1 cDNA sequence (see AAV52933) . The invention relates 

CC to the recombinant production of TGF-beta. Biologically active 

CC TGF-beta is defined as being capable of inducing EGF-potent iated 

CC anchorage independent growth of target cell lines and/or growth 

CC inhibition of neoplastic cell lines. Nucleic acids encoding 

CC TGF-beta have been isolated and cloned into vectors which are 

CC replicated in bacteria and expressed in eukaryotic cells. TGF-beta 

CC recovered from transformed cells is used in known therapeutic 

CC applications. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 3 90 AA; 

Query Match 89.3%; Score 1910.5; DB 19; Length 390; 
Best Local Similarity 90.1%; Pred . No. 3e-163; 

Matches 363; Conservative 10; Mismatches 17; Indels 13; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

i M I ' I I , I IN I'l I I I I ! I I I II lllllllllll llllllllllllllllllll 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAPGLSTCKTI DMEQVKRKR I EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

1 1 1 1 1 1 ^ 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 ri n n i 1 1 1 1 1 1 1 1: hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 



Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

Mill : I hi I II I Ml III Ml II I II I II I II I II MM Ml Ml Mill hill 



Db 



121 YDKFKQSTHS I YMFFNTS ELREAVPE PVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 



Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 240 

MMIIIIMMIMimiMMlMhl II MM! II I I h 1 1 1 1 I = 1 1 1 1 

181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 
241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

:| I! II II I I M i 1 1 1 1 : M I I ; : I I 'I I MINI hill 1 I 

241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 

3 01 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPY1WSLDTQYSKVLALYNQHN 360 

IIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

2 88 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 3 47 

3 61 PGASAAPCCVPQALEPLP1VYYVGRKPKVEQLSNMI VRSCKCS 4 03 

I M I M 1 1 M 1 1 1 M II M II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II M II I 

348 PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 3 90 



Search completed: October 28, 2003, 09:06:48 
Job time : 34.6197 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 



October 28, 2003, 09:09:54 ; Search time 24.4507 Seconds 

(without alignments) 
2760.110 Million cell updates/sec 

US-10-017-372E-33 
2139 

1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNM I VRSCKCS 403 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



629382 seqs, 167460630 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



629382 



Database 



Published_Applications_AA: * 

1 : /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep: 

2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB . pep : * 

3 : /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB.pep 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB .pep 

5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep 

6 : / cgn2_6 /p toda t a / 2 /pubpaa / PCTUS_PUBCOMB . pep : 



7 : /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB . pep : * 

8 : /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 

9 : /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB . pep : * 

10 : /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep-. * 

11 : /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB . pep : * 

12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep: * 

13 : /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep: * 

14 : /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep: * 

15 : /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB.pep: * 

16 : /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep: * 

17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep: * 

18 : /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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RESULT 1 

US-09-214-592-26 

Sequence 26, Application US/09214592A 
Publication No. US200300272 18A1 
GENERAL INFORMATION: 
APPLICANT : Yamasaki , CMotoo 
APPLICANT : Shibata , CKenj i 
APPLI CANT : Sato , CYasuf umi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/2 14 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 26 
LENGTH: 390 
TYPE : PRT 
ORGANISM: porcine 
US-09-214-592-26 

Query Match 95.0%; Score 2032.5; DB 11; Length 390; 

Best Local Similarity 95.8%; Pred. No. 1.4e-181; 

Matches 386; Conservative 1; Mismatches 3; Indels 13; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I IIIIIIIIIIMIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIMIIIIMM 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

llllllllllllllllllllllllllllllllllllllllllllllllllllhllllll 
Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMLESGNQI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

IIIMIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIMIIIIIMIII 

Db 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

1 1 II I II 1 1 II 1 1 II M II 1 1 1 1 1 1 1 1 1 II I II 1 1 M M 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 M 

Db 181 YLSNRLLAPSDS PEWLS FDVTGWRQWLTRR EA I EGFRLSAHCS CDS KDNTLH VE I NGFN 240 



Qy 


241 


UD 


24 1 


Qy 


301 


Db 


288 


Qy 


361 


Db 


348 



1 1 II 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 L II 1 1 1 1 1 1 1 1 MINIMI 

:GDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFS 287 



I II I I I I I I ' I I II I ! I II I I I I II I I I IM I I I I , I M I I I : I I I I I I I I ' I I 
ITEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 

'GAS AAPCCVPQALE PLP I VY Y VGRKPKVEQLSNM I VRS CKCS 4 03 

i;irn mi ii iii inii i mini ii ii iiiiiiii 



RESULT 2 

US-09-214-592-29 

Sequence 29, Application US/09214592A 
Publication No. US20030027218A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 



Yamasaki , CMotoo 
Shibata, CKenj i 
Sato, CYasuf umi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER : US/09/2 14 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 3 4 
SOFTWARE : 
SEQ ID NO 29 
LENGTH: 3 90 
TYPE: PRT 
ORGANISM: ovine 
US-09-214-592-29 

Query Match 90.3%; Score 1931.5; DB 11; Length 390; 

Best Local Similarity 90.6%; Pred. No. 4e-172; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 6 0 

I IIIIIMIIIIIIMIhllllll IMIIIIIIIIIIMII 1 1 MM 1 1 1 1 1 1 1 1 

Db 1 MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKTI DMELVKRKGI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

M M I III I II 1 1 1 1 M M I M IM Ml I II I I MMMMMMMMMM MM 

Db 61 SPPSQGDVPPGPLPEAILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Ml I = MMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I M h 1 1 1 

Db 121 YDKMKSSSHS I YMFFNTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

I II 1 1 M I M I M I Ml II I M M 1 1 M I II MIIIIMI I Mill 1 1 hlllM 

Db 181 YLSNRLLAPSDSPEWLSFDVTGVVRQWLTHREEI EGFRLSAHCSCDSKDNTLQVDINGFS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

M II M I II I M I II IM I Ml IM MM 'I I M M 1 1 MINIMI 



Db 



241 



SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR 



•ALDTNYCFS 287 



Qy 301 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

MIMMMIMIIIMIIIIIIIIIIIIMMIMIIIIIIIIIIIMIIIIMIMII 

Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 403 

IIIIIIIIIMIIIIIIIIIIIIIMIMIIIIIIIMIIIII 

Db 348 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 3 
US-10-087-268-2 

Sequence 2, Application US/10087268 
Publication No. US20030119010A1 
GENERAL INFORMATION: 
APPLICANT: Jonsonn, Julie Ruth 
APPLICANT: Powell, Elizabeth Ellen 

TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

FILE REFERENCE: Fibrosis 

CURRENT APPLICATION NUMBER : US/10/ 087,268 
CURRENT FILING DATE: 2 002-03-01 
NUMBER OF SEQ ID NOS : 6 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 2 
LENGTH: 3 90 
TYPE: PRT 
ORGANISM: Human 
US-10-087-268-2 

Query Match 90.2%; Score 1928.5; DB 15; Length 390; 

Best Local Similarity 90.8%; Pred. No. 7.6e-172; 

Matches 366; Conservative 10; Mismatches 14; Indels 13; Gaps 1; 

MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAI RGQI LSKLRLA 6 0 

I IMIMIIIIIMIIIIIIIIIIIIIMIIIIIIIMIIIIIMIIIIIIMIMIII 

MP P SGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKT I DMELVKRKR I EAI RGQ I LS KLRLA 6 0 

SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

IIIMhIIIIIIIIIIIIIIIIIIIIMIII lllllllllllllllllllllh hi 
SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

INN : Ihll M II M M II 1 1 1 1 1 M I M I M 1 1 1 M I M M II M M II h M I 

YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 
YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 ^ I I III I MM I Ihllll hllll 

YLSNRLLAPS DS P EWLS FDVTG VVRQWLS RGGEI EGFRLS AHCS CDSRDNTLQVD I NGFT 24 0 

SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 300 

M I I I I I I I I | M I I I I I I I I I I I I I I I I I I 1 I 1 I I ! I ' , I I I I I 

TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 

STEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

IMIMMMIMIIIIMIIIMIIIIIMIMIIMMIIIIIIIIIIIIIIIIIIII 



Qy 


i 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 



Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 



Qy 361 PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 4 03 

I M I M M I M I II M 1 1 M I M I M 1 1 M M M M M M I M 

Db 348 PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 390 

RESULT 4 
US-10-087-268-5 

Sequence 5, Application US/10087268 
Publication No. US20030119010A1 
GENERAL INFORMATION : 
APPLICANT: Jonsonn, Julie Ruth 
APPLICANT: Powell, Elizabeth Ellen 

TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

FILE REFERENCE: Fibrosis 

CURRENT APPLICATION NUMBER : US/10/087 , 268 
CURRENT FILING DATE: 2002-03-01 
NUMBER OF SEQ ID NOS : 6 
SOFTWARE : Patent In version 3.1 
SEQ ID NO 5 
LENGTH: 3 90 
TYPE : PRT 
ORGANISM: Human 
US-10-087-268-5 

Query Match 8 9.8%; Score 1921.5; DB 15; Length 3 90; 

Best Local Similarity 90.6%; Pred . No. 3.4e-171; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; Gaps 1; 

MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

i iiiiiiiiiiiiiiiiiiiiii ! i ii i mi 1 1 1 1 1 1 1 ii 1 1, 1 1 1: 1 1 1 1 ii 1 1 1 m 

MP PSGLRLLPLLLPLLWLLVTiTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

M IM M I M I M I II II I MIM I MM III II II II MM I M II M I M IM hi 
SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 
Mill : Ihll ' I I , I 

YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVE I NGFN 24 0 

1 1 1 1 : 1 1 1 , 1 1 , 1 1 1 : 1 1 1 ; ii 1 1 1 1 1 h i MiiiiiM i ihllll hiiii 

YLSNRLLAPSDS PEWLSFDVTGWRQWLSRGGE I EGFRLSAHCSCDS RDNTLQVD I NGFT 24 0 



Qy 


l 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


288 


Qy 


361 



1 1 II 1 1 Ml r 1 1 1 1 1 1 1 'II , II II 1 1 ,: 1 1 1 1 1: HIM 

X3RRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTN' 

ITEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYl 

MIIMMIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIMII 

TEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYl 
'GASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 4 03 

MIIMIMIIIIIIIIIIIMMIIIIIIIIIIIIIIIIII 



Db 34 8 PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 5 
US-10-276-947-1 

Sequence 1, Application US/10276947 
Publication No. US20030176315A1 
GENERAL INFORMATION: 
APPLICANT: GLAXO GROUP LIMITED 
TITLE OF INVENTION: MEDICAL PRODUCTS 
FILE REFERENCE: PG3 94 9 

CURRENT APPLICATION NUMBER: US/ 1 0/276 , 94 7 
CURRENT FILING DATE: 2 002-11-21 
PRIOR APPLICATION NUMBER: GB 0012991.6 
PRIOR FILING DATE: 2000-05-26 
PRIOR APPLICATION NUMBER: GB 0100286.4 
PRIOR FILING DATE: 2001-01-05 
NUMBER OF SEQ ID NOS : 1 
SOFTWARE: Patent In Ver. 2.0 
SEQ ID NO 1 
LENGTH: 3 90 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-276-947-1 

Query Match 89.7%; Score 1918.5; DB 12; Length 390; 

Best Local Similarity 90.6%; Pred . No. 6.5e-171; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I Ml I I I I I I I > I I I I I I I I I I I i I I II I ! 1 I I: I I I , I I I I I ' I I I I I !, ! 
Db 1 MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

II 1 1 MM Ml M 1 1 IM II II M 1 1 M 1 1 M I IM II II I II IM 1 1 1 1 II IM hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll I M 1 II I ,1 I I! I I I I I M I I I | II I, ! I I , I I I I ' I I I I II 
Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

i m: 1 1 1 1 1 1 1 1 ii i h i ii i 'i 1 1 1 1 h i iiiiMMi i imiiii hill, 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

MMMMlMMI MMIMMMMMIM MMII MINIMI 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 

Qy 301 STEK^CCWQLYIDFRKDIX^WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVTALYNQHN 3 60 

1 1 1 1 1 1 1 1 1 1 MM I II Ml I M I MM II M 1 1 1 M II I 1 1 1 IM 1 1 M 1 1 Ml MM 

Db 28 8 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 03 

I IM 1 1 1 M 1 1 MM I M Ml MM I M M I II 1 1 MM II I M I 

Db 348 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 6 

US-09-214-592-33 

Sequence 33, Application US/09214592A 
Publication No. US20030027218A1 
GENERAL INFORMATION : 



APPLICANT 
APPLICANT 
APPLICANT 



Yamasaki , CMotoo 
Shibata , CKenj i 
Sato, CYasuf umi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/214 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 33 
LENGTH: 3 90 
TYPE : PRT 
ORGANISM: simian 
US-09-214-592-33 

Query Match 89.6%; Score 1916.5; DB 11; Length 390; 

Best Local Similarity 90.3%; Pred . No. le-170; 

Matches 364; Conservative 10; Mismatches 16; Indels 13; Gaps 1; 

MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAI RGQILSKLRLA 6 0 

I IIIIIIMIIIIIIIIIIIII 1 1 1 1 1 1 II I U I "j 1 1 1 1 llllllllllll 

M P PSGLRLL PLLLPLLWLLVLT PS RPAAGLSTCKT I DMELVKRKR I ET I RGQILSKLRLA 60 
SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I M I h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 II II 1 1 1 1 1 1 II 1 1 II h hi 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : 11 = 11 IIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIhlll 
YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

IMMMIIhllllllllllllMllhl lllllllll I lllllll hllll 

YLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSKDNTLQVDINGFT 24 0 
SGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 0 0 

; 1 1 M M I II 1 1 1 1 1 1 1 1 II M M II 1 1 1 1 1 M M II MMMMI 

TGRRGDLATI HGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 2 8 7 

STEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHN 360 

IIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 
PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 03 

IIIIIIIIIIIIIIIIIIIMMIIMIIIIIMIIIIMIM 



Qy 


l 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


288 


Qy 


361 


Db 


348 



RESULT 7 



US-09-214-592-17 

; Sequence 17, Application US/09214592A 
; Publication No. US20030027218A1 
; GENERAL INFORMATION: 

APPLICANT: Yamasaki, CMotoo 
; APPLICANT: Shiba ta , CKen j i 
; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER: US/09/2 14 , 592A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS : 34 
; SOFTWARE : 
; SEQ ID NO 17 

LENGTH: 3 91 

TYPE : PRT 

ORGANISM: human 
US-09-214-592-17 



Query Match 8 9.3%; Score 1911; DB 11; Length 391; 

Best Local Similarity 90.3%; Pred. No. 3.3e-170; 

Matches 365; Conservative 10; Mismatches 15; Indels 14; Gaps 2; 



Qy 


1 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKT I DMELVKRKRI EAI RGQI LSKLRLA 

1 llllllilllllllllllllll IIIIIIIIIMIIIIIIIIIIIMIIIIMIIIM 

MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKR I EAI RGQI LS KLRLA 


60 


Db 


1 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 
lllllhlllllllllllllllMIIIIIIII llllllilllllllllllllll: hi 


120 


Db 


61 


SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSW 

INN : Ihll llllllllllllllllllllllll lllllllllllllllllhll 
YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRRLKL.KVEQHVELYQKYSNNSW 


179 


Db 


121 


180 


Qy 


180 


RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGF 


239 


Db 


181 


IIIIIIIIIIIIIMMIIIIMIIIIIhl lllllllll 1 Ihllll hllll 
RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 


240 


Qy 


240 


NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCF 

:||||IIIMIIIIIIIMIIMIMIIIM MINI Ml!, Ill 


299 


Db 


241 


TTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCF 


287 


Qy 


300 


SSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQH 

IIIMIIIIIIIIIIMIIMIIIIIIIIIIIIIIIMIIIIMMIMIIIIIMIMI 


359 


Db 


288 


SSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQH 


347 


Qy 


360 


NPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 4 03 

IMIIIIIIIIIIIIIMMMIIIMIIIIIIMIIIMIMI 

NPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 391 




Db 


348 





RESULT 8 

US-09-756-283A-23 

; Sequence 23, Application US/09756283A 



Patent No. US2 00201514 78A1 
GENERAL INFORMATION : 



APPLICANT 
APPLICANT 
APPLICANT 



Chernajovsky, Yuti 
Dreja, Hanna Stina 
Adams, Gillian 
TITLE OF INVENTION: Latent Fusion Protein 
FILE REFERENCE: 0623.1000000 

CURRENT APPLICATION NUMBER : US/ 09/756 , 2 8 3A 
CURRENT FILING DATE: 2001-01-09 
NUMBER OF SEQ ID NOS : 100 
SOFTWARE: Patent In version 3.0 
SEQ ID NO 23 
LENGTH: 3 90 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-756-283A-23 

Query Match 89.2%; Score 1907.5; DB 10; Length 390; 

Best Local Similarity 90.1%; Pred. No. 7e-170; 

Matches 363; Conservative 10; Mismatches 17; Indels 13; Gaps 1; 

MAPSGLRLLPLLLPLLWLLVTiTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 

I 1 1 1 Mi M I ! 1 1 1 1 II 1 1 1 1 1 1 M II MM 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 II I MM M I II 

MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKR I EAI RGQILSKLRLu 

SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQ 
IMM MMIIIIIIII III IM I llllllllllllllllllllll: :: 

sppsqgevppgplpeavlalynstrdrvagesaepepepeadyyakevtrvlmvethhe; 
ydkfkgtphslymlfntselreavpepvllsraelrllrlklkveqhvelyqkysndsw] 

Mill : ii ii ii i ii ii ii i ri 'i 1 1; i , i 1 m ii 1 1 1! ii ii mi iii h 'i 

YDKFKQSTHSTYMFFNISELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSW] 
YLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFI 

I M M l : M 1 1 , 1 1 III I III 1 1 1 Ml lllllllll I Ml MINI 



M M I M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M M M MUM lllllllll 

TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 



I M I MM 1 1 1! 1 1 1 II 1 1 M M M MMMI II M IMM III 1 1 1 1 Ml I II I M 1 1 
:teknccvrqlyidfrkdlgwkwihepkgyhanfclgpcpyiwsldtqyskvlalynqhn 

'gasaapccvpqale plp i vyyvgrkpkveqlsnm i vrsckcs 4 03 

II Ml MMMIIMMMMMMMIMMMIMM! 



Qy 


l 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


288 


Qy 


361 


Db 


348 



120 



RESULT 9 

US-09-214-592-28 

; Sequence 28, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKen j i 



; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION : PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION : OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/214 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE: 
SEQ ID NO 28 
LENGTH: 3 90 
TYPE : PRT 
ORGANISM: canine 
US-09-214-592-28 

Query Match 88.6%; Score 1895.5; DB 11; Length 390; 

Best Local Similarity 89.6%; Pred. No. 9.3e-169; 

Matches 361; Conservative 12; Mismatches 17; Indels 13; Gaps 1; 

MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I IMIIIIIIIMII ' , I : 1 ' 1 1 1 1 1 . ! 1 1 , 1 1 1 1 1 1 1 1 1 1 1 ! I ! : 

MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLS 6 0 
SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllhlll IMI 1 1 III 1 1 1 M 1 1 M 1 1 1 1 1 i 1 1 III 1 1 M 1 1 1 1 II 1 1 ! : hi 

SPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVENTNKI 120 
YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I • h h I M 1 1 1 1 1 ' 1 1, 1 1 1 1 1 1 1 1 ; I i , 1 1 M 1 1 1 ' 1 1, 1 1 1 1 1 

YEKVKKSPHS I YMLFNTSELREAVPEPVLLSRAELRLLRLKLKAEQHVELYQKYSNDSWR 18 0 

YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

I I I I I M I, I h I I I ll I III I I I I I i h :|IMIIII I lllllll hlllh 

YLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 

SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

I I I I M II I I I I 1 I I I I Ml I I I II I I I I I I I I , II III, Mill 

SSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQRR ALDTNYCFS 2 87 

STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 3 60 

MIMI MM MM MM MMI Ml M Ml M II MM MMMM III Mill MMI 

STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 
PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 03 

M I II I MM I II I II II I Ml M I M I II M I II II MM, 

PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 3 90 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


288 


Qy 


361 


Db 


348 



RESULT 10 
US-09-214-592-20 

; Sequence 20, Application US/09214592A 

; Publication No. US200300272 18A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKenj i 

; APPLICANT: Sato , CYasuf umi 



; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION : OF SCREENING TGF- ACTIVITY- REGULATING COMPOUNDS 
; FILE REFERENCE : 11060 

; CURRENT APPLICATION NUMBER : US/09/214 , 592A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS : 34 
; SOFTWARE : 
; SEQ ID NO 20 

LENGTH: 3 90 

TYPE : PRT 

ORGANISM: murine 
US-09-214-592-20 

Query Match 85.9%; Score 1837.5; DB 11; Length 390; 

Best Local Similarity 85.9%; Pred. No. 2.5e-163; 

Matches 346; Conservative 15; Mismatches 29; Indels 13; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 

I iMiiiiiii ii 1 1: in : 1 1 : Mi i M 1 1 1 1 m i 1 :i h 1 1 1 1 1 1 1 1 ini 1 1 

Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

M ! M M I M I II 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 MM M II 1 1 II I M 1 1 M I M II 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I Ihll IMMMIMMI ,111 III 1 1 1 1 1 : 1 I I I II I' I I ! 

Db 121 YEKTKDISHSI YMFFNTSDIREAVPEPPLLSRAELRLQRLKSSVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEI NGFN 240 

II 1 1 II hhllllMMIIMMII = : hill III I INN MM II 

Db 181 YLGNRLLT PTDTP EWLSFDVTG WRQWLNQGDG I QGFRFSAHCS CDS KDN KLH VE I NG I S 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

Mill III I I I I I I I I I I I I I I I M I I I I t I I I I MINIMI 
Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFS 287 

Qy 301 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 03 

1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 i 1 1 

Db 348 PGASASPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 11 
US-09-214-592-23 

; Sequence 23, Application US/09214592A 

; Publication No. US200300272 18A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKenj i 

; APPLICANT: Sa to , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 



TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/214 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 23 
LENGTH: 3 90 
TYPE: PRT 
ORGANISM: rat 
US-09-214-592-23 

Query Match 85.9%; Score 1837.5; DB 11; Length 390; 

Best Local Similarity 85.9%; Pred. No. 2.5e-163; 

Matches 346; Conservative 14; Mismatches 30; Indels 13; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 6 0 

I 'I I I II I I , I I I I I I ; I I I I I I 'I II I! I . I I I I I II I I I I Ml I II I I I I I I I I 
Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

I M M h 1 1 1 1 M 1 1 M M 1 1 11 M M II 1 1 1 :|l.( 1 1 M I Mi.1,11 Ml 1 1: I I 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

Ml I 1 1 M I 1 1 1 M M 1 1 II II MMMMI I I 1 1 li 1 1 1 1 1 1 1 1 M 1 1 1 

Db 121 YDKTKDITHS I YMFFNTSDI REAVPEPPLLSRAELRLQRFKSTVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

II 1 1 1 1 MMI MMMMI II II II : = Mill III I Mill II Mill 

Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDGIQGFRFSAHCSCDSKDNVLHVEINGIS 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 300 

Mill Ml I II 1 1 M 1 1 1 1 II I ! l II 1 1 1 1 1 M , MMMMI 

Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFS 287 

Qy 301 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

II 1 1 M 1 1 1 1 1 1 Ml 1 1 1 1 1 h' I II I Ml 1 1 M M M I 'I I MM 1 1, 1 1 1 1 1 1 1 1 1 II 

Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 4 03 

1 1 MM I M M Ml 1 1 M I M 1 1 II MM M 1 1 1 M 1 1 1 II ! I 

Db 348 PGASASPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 12 
US-09-214-592-25 

; Sequence 25, Application US/09214592A 
; Publication No. US20030027218A1 
; GENERAL INFORMATION: 

APPLICANT: Yamasaki , CMotoo 
; APPLICANT: Shiba ta , CKenj i 
; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 



CURRENT APPLICATION NUMBER: US/09/2 14 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 25 
LENGTH: 315 
TYPE: PRT 
ORGANISM: bovine 
US-09-214-592-25 

Query Match 73.6%; Score 1574.5/ DB 11; Length 315; 

Best Local Similarity 89.9%; Pred . No. 7.4e-139; 

Matches 295; Conservative 9; Mismatches 11; Indels 13; Gaps 1; 
Qy 76 AVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLF 135 

h I II 1 1 1 II 1 1 1 M 1 1 I I II 1 M II 1 1 1 1 1 1 M M II Ihllll I = Ihll I 

Db 1 AI LALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI YDKMKSSSHS I YMFF 60 

Qy 136 NTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEW 195 

I I I I I I I I I I I I I I I I I h : I I I I I I I I I I II I I I I I I I I I : I I I I I I I I I I I I I I I I I I 

Db 61 NTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEW 12 0 

Qy 196 LSFDVTGVVRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATI HGMNR 255 

1 1 r i m 1 1 1 1 1 1 : 1 1 1 minimi i mmi i 1 1 1 1 M 1 1 1 : 1 1 1 1 ; 1 1 1 1 1 

Db 121 LSFDVTGWRQWLTRREEIEGFRLSAHCSCDSKDNTLQVDINGFSSGRRGDLATIHGMNR 180 

Qy 2 56 PFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDF 315 

IIIIIIMIIIIIIIIII Mill IIIIIIIIMI III lllllll III 

Db 181 PFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKNCCVRQLYIDF 227 

Qy 316 RKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALE 3 75 

I I M II 1 1 1 II II II I II I II I II II II I II II I M II 1 1 II I M II I II II I II I M 1 1 

Db 22 8 RKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVXALYNQHNPGASAAPCCVPQALE 287 

Qy 376 PLPIVYYVGRKPKVEQLSNMIVRSCKCS 403 

MIMMMMMMMMMMIMM 

Db 288 PLPIVYYVGRKPKVEQLSNMIVRSCKCS 315 



RESULT 13 
US-09-756-283A-20 

; Sequence 20, Application US/09756283A 

; Patent No. US20020151478A1 

; GENERAL INFORMATION: 

; APPLICANT: Cherna j ovsky , Yuti 

APPLICANT: Dreja, Hanna Stina 
; APPLICANT: Adams, Gillian 

; TITLE OF INVENTION: Latent Fusion Protein 
; FILE REFERENCE: 0623.1000000 

; CURRENT APPLICATION NUMBER: US/09/756 , 283A 

; CURRENT FILING DATE: 2001-01-09 

; NUMBER OF SEQ ID NOS : 100 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 20 

LENGTH: 455 

TYPE: PRT 

ORGANISM: Artificial Sequence 



FEATURE : 

OTHER INFORMATION: LAP-mlFN construct 
US-09-756-283A-20 



Query Match 59.0%/ Score 1262.5; DB 10; Length 455; 

Best Local Similarity 90.5%; Pred. No. 2e-109; 

Matches 248; Conservative 10; Mismatches 15; Indels 1; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAI RGQ I LSKLRLA 60 

I llllllllllllllllllllll I; I II I < I I ! 1 1 II !! I II II I II ' 1 1 1 1 1 1 1 1 1 1 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1 1 I M 1 1 1 1 1 1 1 1 1 Ml I II! I !, 1 1 1 1 1 I M 1 1 Ml I II 1 1 Ml ,'l 1 1 1 1: hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSW 179 

Mill : Ihll i 1 1 r I J I I I ] I I I I j I I I I I f I I I I I I I I I I M I II I II II M II 
Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 18 0 

Qy 18 0 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGF 23 9 

II M I M M 1 1 II II II III MM I M I Ml lllllllll I IMMII hllll 

Db 181 RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 24 0 

Qy 24 0 NSGRRGDLATI HGMNRPFLLLMATPLERAQHLHS 273 

MM 1 1 1 M I II 1 1 II I M MM III III I I 

Db 241 TTGRRGDLATI HGMNRPFLLLMATPLERAQHLQS 274 



RESULT 14 
US-09-756-283A-22 

; Sequence 22, Application US/09756283A 
; Patent No. US20020151478A1 
; GENERAL INFORMATION: 

APPLICANT: Chernajovsky, Yuti 
; APPLICANT: Dreja, Hanna Stina 
; APPLICANT: Adams, Gillian 
; TITLE OF INVENTION : Latent Fusion Protein 
; FILE REFERENCE: 0623.1000000 
; CURRENT APPLICATION NUMBER: US/09/756 , 283A 
; CURRENT FILING DATE: 2001-01-09 
; NUMBER OF SEQ ID NOS : 100 

SOFTWARE: Patent In version 3.0 
; SEQ ID NO 22 
LENGTH: 44 7 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: ml FN -LAP construct 
US-09-756-283A-22 

Query Match 53.2%; Score 1139; DB 10; Length 447; 

Best Local Similarity 87.0%; Pred. No. 7e-98; 

Matches 228; Conservative 10; Mismatches 18; Indels 6; Gaps 3; 
Qy 14 PL - LWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAI RGQI LSKLRLAS PPSQGDVPPGP 72 

II 1 1 I II IMI M M M MMMM 1 1 1 MMIM Ml IMM II I Mill M 



Db 190 PLGLW AGGGSAAALSTCKTIDMELVKRKRIEAI RGQILSKLRLASPPSQGEVPPGP 245 

Qy 73 LPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLY 132 

Ml Ml II! I MM MINI I M M I M M M I M I II I II I = M 1 1 M M ■■ MM 

Db 246 LPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEIYDKFKQSTHSIY 305 

Qy 133 MLFNTSELREAVPEPVXLSRAELRLL-RLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSD 191 

I MMMMMMIMMMMMI M M M I M M 1 1 M I M M 1 1 M M M I M 1 1 

Db 306 MFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSD 365 

Qy 192 S PEWLS FDVTG WRQWLTRREA I EGFRLSAHSS S DS KDNTLHVE I NGFNSGRRGDLATI H 251 

III I I I I I I I I I II II iM lllllllll I Ihllll hllll MIMIIMM 
Db 366 SPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFTTGRRGDLATIH 425 

Qy 252 GMNRPFLLLMATPLERAQHLHS 2 73 

I I M 1 1 1 1 1 1 1 1 1 1 1 1 1 M I I 

Db 426 GMNR PFLLLMATPLERAQHLQS 447 



RESULT 15 
US-09-214-592-32 

; Sequence 32, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKenj i 

; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTI VI TY- REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER: US/09/214 , 592A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS : 34 
; SOFTWARE : 
; SEQ ID NO 32 

LENGTH: 373 

TYPE : PRT 

ORGANISM: chicken 
US-09-214-592-32 



Query Match 4 9.0%; Score 1049; DB 11; Length 373; 

Best Local Similarity 53.5%; Pred. No. 1.4e-89; 

Matches 207; Conservative 52; Mismatches 100; Indels 28; Gaps 8; 

Qy 30 LSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVA 89 

Mlh MM h I I! I hi I III I I I I I Ml : II |||: I II M I I - : 
Db 2 LSTCQRLDLEAAKKKRI EAVRGQI LSKLRLTAPPPASETPPRPLPDDVRALYNSTQELLK 61 

Qy 90 GES- VEPEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPV 148 

' I h =1 = 111= h I = = = = II- Ml I I I 

Db 62 QRARLRPP PDGPDE YWAKELRR I PMETTWDGAMEHWQPQSHS I FFVFN VSRARRG -GRPT 120 



Qy 149 LLSRAELRLLRLKLK VEQHVELYQKYSNDSWRYLSNRLLAPSDS PEWLS FDVTGV 203 

M 11111 = 11 I II Mill I I Mill I = : I M II I I I 

Db 121 LLHRAELRMLRQKAAADSAGTEQRLELYQGYGNASWRYLHGRSVRATADDEWLSFDVTDA 180 



Qy 204 VRQWLTRREAI EGFRLSAHSSSD SKDNTLHVEI NGFNSGRRGDLATI HGMNR - - PFL 258 

I |||: I : hll I : = = 111 =111= ' I =1 l = = 

Db 181 VHQWLSGSELLGVFKLS VHCPCEMGPGHAEEMR I S I EGFEQ - QRGDMQS I AKKHRRVPYV 23 9 

Qy 259 LLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCF - - SSTEKNCCVRQLY I DFR 316 

I II I III 111 = 1 II 111 = 111 : I MINI llllll 

Db 24 0 LAMALPAERANELHSARRRR -DLDTDYCFGPGTDEKNCCVRPLYI DFR 28 6 

Qy 317 KDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEP 376 

III lllll III II III 1=1 III Ull lllh III! llll III III II III III m 
Db 287 KDLQWKWIHEPKGYMANFCMGPCPYIWSADTQYTKVLALYNQHNPGASAAPCCVPQTLDP 346 

Qy 377 LPI VYYVGRKPKVEQLSNMI VRSCKCS 403 

llhlllll = 1111111 = 11 = 1111 
Db 34 7 LPI I YYVGRNVRVEQLSNMWRACKCS 373 



Search completed: October 28, 2003, 09:28:02 
Job time : 25.4507 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



October 28, 2003, 00:37:16 ; Search time 13.3169 Seconds 

(without alignments) 
2910.285 Million cell updates/sec 

US-10-017-372E-33 
2139 

1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNM I VRSCKCS 4 03 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 
283308 seqs, 96168682 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



Database 



PIR_76 : * 
1: pirl:* 

pir2 : * 
pir3 : * 

pir4 : * 



283308 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 



A27512 

transforming growth factor beta-1 precursor - pig 
N,-Alternate names: TGF-beta 

C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 05-Jun-1988 #sequence_revision 05-Jun-1988 #text_change 16-Jul-1999 

C;Accession: A27512; A26356; 146657 

R;Derynck, R. ; Rhee, L . 

Nucleic Acids Res. 15, 3187, 1987 

A;Title: Sequence of the porcine transforming growth factor-beta precursor. 
A;Reference number: A27512; MUID : 87174844 ; PMID:3470708 
A; Access ion: A27512 
A;Molecule type: mRNA 
A/Residues : 1-390 <DER> 

R;Cheifetz, S.; Weatherbee, J. A.; Tsang, M.L.S.; Anderson, J.K.; Mole, J.E.; 
Lucas, R. ; Massague, J. 
Cell 48, 409-415, 1987 

A ; Title: The transforming growth factor-beta system, a complex pattern of cross- 
reactive ligands and receptors. 

A/Reference number: A90890; MUID: 87102890; PMID:2879635 
A; Accession : A263 56 
A;Molecule type: protein 
A/Residues: 279-322 <CHE> 

R;Kondaiah, P.; Van Obberghen-Schillmg, E. ; Ludwig, R.L.; Dhar, R. ; Sporn, 
M.B. ; Roberts, A . B . 

J. Biol. Chem. 263, 18313-18317, 1988 

A;Title: cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 
Evidence for alternate splicing and polyadenylation. 
A;Reference number: 146657; MUID : 89054010 ; PMID:2461367 
A; Access ion : 14 6657 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-390 <KON> 

A; Cross-references : GB:M23703; NID:g755044; PIDN:AAA64616 . 1 ; PID:g755045 
C;Genetics : 

A; Gene: TGFB ; TGF-beta- 1 
C;Superf amily : inhibin 
C; Keywords: growth factor 

Query Match 95.2%; Score 2 035.5; DB 2; Length 390; 

Best Local Similarity 96.0%; Pred. No. 1.2e-155; 

Matches 387; Conservative 0; Mismatches 3; Indels 13; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 



Db 



1 




Qy 



61 



SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 



Db 



61 



SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 



Qy 



121 



YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 



Db 



121 




Qy 



181 



YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 



Db 



181 




Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

' 1 1 1 1 1 1 1 1 II I M 1 1 1 ! , I II I M . 1 1 1 M 1 1 1 1 , lllllllll 

Db 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFS 287 

Qy 3 01 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

ii mi 1 1 Mill ., ill 1 1 1 1 1. 1 '. . 1 1 1 ; i , i n: 1 1 . i 'i 1 1 1 ii 1 1 1 ill 

Db 288 STEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 403 

I M II 1 1 1 II II I M 1 1 1 1 II M 1 1 1 M M II I II II II II II 

Db 348 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 2 
S01413 

transforming growth factor beta-1 precursor - chicken 
C;Species: Gallus gallus (chicken) 

C;Date: 30-Jun-1989 #sequence_revision 30-Jun-1989 #text_change 24-Nov-1999 
C; Access ion: SO 14 13 

R;Jakowlew, S.B.; Dillard, P.J.; Sporn, M.B.; Roberts, A . B . 
Nucleic Acids Res. 16, 8730, 1988 

A; Title: Nucleotide sequence of chicken transforming growth factor-beta 1 (TGF- 
beta 1) . 

A;Reference number: S01413; MUID : 88335639 ; PMID:3166520 
A; Access ion : SO 14 13 
A; Molecule type: DNA 
A;Residues: 1-391 <JAK> 

A; Cross-references: EMBL:X12373; NID:g63808; PI DN : CAA3 0933 . 1 ; PID:g63809 
C; Super family: inhibm 
C;Keywords: growth factor 

Query Match 93.7%; Score 2 004; DB 2; Length 391; 

Best Local Similarity 95.0%; Pred. No. 4e-153; 

Matches 384; Conservative 0; Mismatches 6; Indels 14; Gaps 2; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 

I Ml 1 1 1 M II M 1 1 M M II M M M M 1 1 M I M M II 1 1 M M I M II M I M I 

Db 1 MPPSGPGLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I I > I . ' I I ! I ! I I i I I I I I I I 1 I ' I . I I . I I I I I I I i I I ::![ I I I I I I I I I I i I ' ! II 

Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! M I M 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWG 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEIN-GF 239 

1 1 1 M 1 1 1 M I M 1 1 ! M 1 1 M M I M I II 1 1 M 1 1 1 1 1 1 M I II 1 1 II II 1 1 1 1 II 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINAGF 240 

Qy 24 0 NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCF 2 99 

II M M M Mill M II I II I MM Ml I M II M I II I Illlllll 

Db 241 NSGRRGDLAT I HGMNRP FLLLMAT PLERAQHLH S S RHRR ALDTNYCF 287 



Qy 



300 SSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQH 359 



MMIM IM MIMMII MMIIMI I MMMMIMM Mill IMMI IIIMM 

Db 288 SSTEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQH 347 

Qy 360 NPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 03 

MMMMMMMMMMMMMMM MMMMMMM 

Db 348 NPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKGS 3 91 



RESULT 3 
146463 

transforming growth factor beta-1 - sheep 

C; Species: Ovis oriental is aries, Ovis ammon aries (domestic sheep) 

C;Date: 19-Dec-1997 #sequence_revision 19-Dec-1997 #text__change 24-Nov-1999 

C;Accession: 146463; S45115 

R;Woodall, C.J.; McLaren, L.J.; Watt, N.J. 

Gene 150, 371-373, 1994 

A;Title: Sequence and chromosomal localisation of the gene encoding ovine latent 
transforming growth factor-beta 1. 

A /Reference number: 146463; MUID : 95121 932 ; PMID:7821809 
A; Accession : 14 64 63 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-390 <WOO> 

A; Cross-references: EMBL:X76916; NID:g496648; PIDN : CAA54242 . 1 ; PID:g496649 
A;Note: submitted to the EMBL Data Library, December 1993 
C; Superfamily : inhibin 

Query Match 9 0.3%; Score 1931.5; DB 2; Length 390; 

Best Local Similarity 90.6%; Pred . No. 2.6e-147 ; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; Gaps 1; 

MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 

I 1 1 M ' M 1 1 1 1 1 M MIM Ml II MMMMMMIMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKTI DMELVKRKGI EAI RGQI LSKLRLA 6 0 
SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKE VTRVLMVESGNQI 12 0 

Ml 1 1 1 M 1 1 1 1 II I Ml I IM M M M M M I M M M 1 1 1 M I M 1 1 M M MM 

SPPSQGDVPPGPLPEAILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI 12 0 
YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

III I = IM M II I M I III M M 1 1 1 MM 1 1 II M 1 1 1 IM 1 1 1 M I IM- Ml 

YDKMKSSSHS I YMFFNTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWR 18 0 
YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

III I M M 1 1 1 II I MMI II IM MM I II MINIMI I 1 1 1 1 1 1 1 Ml 

YLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEIEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 
SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

I I M 1 1 M M 1 1 1 M II II 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 MIM I 

SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFS 28 7 

STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

1 1 i 1 1 1 1 1 1 1 1 1 , 1 1 1 1 II , 1 1 1 1 M . 1 1 1 1 1 1 1 II ,■ 1 1 1 i 1 1 ! 1 1 1 1 1 1 , M I ! 1 1 1 M 

STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 34 7 
PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 4 03 

1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 r 1 1 



Qy 


i 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


288 


Qy 


361 



Db 348 PGASAAPCCVPQALEPLP I VY Y VGRKPKVEQLSNMI VRSCKCS 390 



RESULT 4 
WFHU2 

transforming growth factor beta-1 precursor [validated] - human 
N,-Alternate names: growth- inhibitory factor; TGF type 2; TGF-beta 
C; Species: Homo sapiens (man) 

C;Date: 28-Feb-1986 #sequence_revision 19-Oct-1995 #text_change OS-Dec-2000 
C;Accession: A27513; A01395; A22290; 159664/ S53444 
R;Derynck, R.; Rhee, L. ; Chen, E.Y.; Van Tilburg, A. 
Nucleic Acids Res. 15, 3188-3189, 1987 

A; Title: Intron-exon structure of the human transforming growth factor-beta 
precursor gene. 

A;Reference number: A27513; MUID : 87174845 ; PMID:3470709 
A; Access ion : A27513 
A; Molecule type: DNA 
A/Residues : 1-390 <DER> 

A; Cross-references : GB:X05839; GB:Y00112; NID:g37097; PIDN : CAA29283 . 1 ; 
PID:gl212989 

R;Derynck, R.; Jarrett, J. A.; Chen, E.Y.; Eaton, D.H.; Bell, J.R.; Assoian, 
R.K.; Roberts, A.B.; Sporn, M.B.; Goeddel, D.v. 
Nature 316, 701-705, 1985 

A; Title: Human transforming growth factor-beta complementary DNA sequence and 
expression in normal and transformed cells. 
A;Reference number: A01395; MUID : 85296301 ; PMID: 3861940 
A /Accession : AO 13 95 
A; Molecule type: mRNA 

A; Residues: 1-9, 'P' ,11-24, ' P' , 26-159, 'R' , 160-3 9 0 <DE2> 

A; Cross-references: GB:X02812; GB:J05114; NID:g37092; PIDN : CAA2 6580 . 1 ; 
PID:g37093 

A;Note: the authors suggest that residues 8-23 could represent the hydrophobic 

core of an amino- terminal signal peptide 

R;Massague, J.; Like, B. 

J. Biol. Chem. 260, 2636-2645, 1985 

A;Title: Cellular receptors for type beta transforming growth factor. Ligand 

binding and affinity labeling in human and rodent cell lines. 

A/Reference number; A22290; MUID : 85131019 ; PMID: 2982829 

A;Accession: A22290 

A;Molecule type: protein 

A;Residues: 279-295, 'XX' , 298-301 <MAS> 

R/Urushizaki , Y. ; Niitsu, Y.; Terui, T. ; Koshida, Y. ; Mahara , K. ; Kohgo, Y. ; 
Urushizaki, I.; Takahashi , Y.; Ito, H. 
Tumor Res. 22, 41-55, 1987 

A;Title: Cloning and expression of the gene for human transforming growth 
factor-beta in Escherichia coli. 
A;Reference number: 159664 
A, -Access ion : 159664 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 279-390 <RES> 

A; Cross -references: GB:M38449; NID : g33 9557 ; PIDN :AAA3 6735 . 1 ; PID:g339558 
R;Stam, K. ; Stewart, A. A. ; Qu, G . Y . ; Iwata, K.K.; Fenyoe, D. ; Chait, B.T.; 
Marshak, D.R.; Haley, J.D. 
Biochem. J . 305, 87-92, 1995 

A ; Title: Physical and biological characterization of a growth- inhibitory 
activity purified from the neuroepithelioma cell line A673 . 



A;Reference number: S53444; MUID : 95126934 ; PMID:7826358 
A; Accession: S534 44 
A; Status: preliminary 
A; Molecule type: protein 
A;Residues: 279-297 <STA> 

C; Comment: The mature protein is the carboxyl -terminal segment of a precursor 
polypeptide; the active molecule is a dimer of identical polypeptide chains 
linked by an interchain disulfide bond. 
C;Genetics : 

A; Gene: GDB : TGFB1 ; TGFB 

A; Cross-references : GDB: 12072 9; OMIM: 19018 0 
A ; Map position: 19ql3 .2-19ql3 . 2 
C; Superfamily : inhibin 

C; Keywords: glycoprotein; growth factor; homodimer; mitogen; transformation 
F; 1-18/Domain : signal sequence #status predicted <SIG> 
F; 19 -278 /Domain : propeptide #status predicted <PRO> 
F;244-246/Region: cell attachment (R-G-D) motif 

F;279-390/Product : transforming growth factor beta-1 tfstatus experimental <MAT> 
F; 82 , 136 , 176/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 89.7%; Score 1918.5; DB 1; Length 390; 

Best Local Similarity 90.6%; Pred . No. 2.9e-146; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I lllllll I I I I ' I II I I! I I I I I I I I I I I I I I I I I I I I ' I I I I II I I II M I ! I I I 
Db 1 MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

I II 1 1 h 1 1 1 1 1 1 1 1 ; 1 1 ' Ml 1 1 1 1 1 1 1 M I 1 1 1 1 i 1 1 1 h 1 1 ' I Ml II 1 1 h Ml 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

MMI : I M M MMMMMMMMIMMMMMMMMMMMMMMI 

Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDS PEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVE I NGFN 240 

M I M I MM i 1 1 1 1 1 1 II I II M 1 1 1 1 M MMMMI I MUM MMM 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

Ml II I M 1 1 1 M I M I M Ml I II Ml II I MINI MMMMI 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 2 87 

Qy 301 STEKNCCWQLYIDFRKD1X5WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ( I I I 
Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 403 

I I I I I II I I 1 I I I I I I I I I I I I I I f I I I I I I I I I I i I I I I I I I 
Db 34 8 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 5 
A26960 

transforming growth factor beta-1 precursor - green monkey 
C; Species: Cercopithecus aethiops (green monkey, grivet) 



C/Date: 05-Oct-1988 #sequence_revision 05-Oct-1988 #text_change 24-Nov-1999 
C; Access ion: A26 960 

R;Sharples, K.; Plowman, G.D.; Rose, T.M. ; Twardzik, D.R.; Purchio, A . F . 
DNA 6, 239-244, 1987 

A;Title: Cloning and sequence analysis of simian transforming growth factor-beta 
CDNA . 

A;Reference number: A26960; MUID : 87246074 ; PMID:3474130 
A; Accession: A26 96 0 
A; Molecule type: mRNA 
A/Residues : 1-390 <SHA> 

A; Cross-references : GB:M16658; NID : gl76552 ; PIDN : AAA35369 . 1 ; PID:gl76553 
C; Superf amily : inhibin 
C; Keywords: growth factor 

F; 1 - 16 /Domain : signal sequence #status predicted <SIG> 

F; 17-390/ Product : transforming growth factor beta #status predicted <MAT> 

Query Match 89.6%; Score 1916.5; DB 2; Length 390; 

Best Local Similarity 90.3%; Pred. No. 4.2e-146; 

Matches 364; Conservative 10; Mismatches 16; Indels 13; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I IIIMMIII II Mlllli II 1 1 1 1 1 1 1 1 1 M II I M II I , I 1 1 ■ 1 1 1 ' 1 1 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTI DMELVKRKRI ETI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I I I I h I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | | | | I | | | I | | | | | | | : | : | 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill = Ih III II I IM I :l 1 1 IIIIIIIMIIII'I Mill II I Ihlll 

Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

MMMMMMMMMMMMMMM MMIMM I MMMI MMM 

Db 181 YLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSKDNTLQVDINGFT 240 

Qy 241 SGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 300 

MMMMMMMMMMMMMMM! MMM MMMMI 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 

Qy 301 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 MM MMMMI MMMMMMMMMMMMMMM 

Db 288 STEKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 4 03 

! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 34 8 PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 6 
JC4023 

transforming growth factor beta-1 - dog 
C; Species: Canis lupus familiaris (dog) 

C;Date: 13-Jun-1995 #sequence_revision 14-Jul-1995 #text_change 24-Nov-1999 
C;Accession: JC4023 

R;Manning, A.M.; Auchampach, J. A. ; Drong, R.F.; Slightom, J.L. 
Gene 155, 307-308, 1995 



A; Title: Cloning of a canine cDNA homologous to the human transforming growth 
factor-beta 1 -encoding gene. 

A /Reference number: JC4023; MUID : 95237630 ; PMID:7721110 
A /Accession : JC4 023 
A;Molecule type: mRNA 
A;Residues: 1-390 <MAN> 

A;CrosS-referenceS: GB:L34956; NID:g516071; PIDN : AAA51458 . 1 ; PID:g516072 

C;Comment: This factor plays a multifunctional role as a regulator of mammalian 

cell growth and as a modulator of immune responses. 

C;Genetics : 

A;Gene: tgf-betal 

C; Super family: mhibin 

C; Keywords: growth factor; transforming protein 

F;288-390/Product : transforming growth factor beta 1 #status predicted <MAT> 

Query Match 88.6%; Score 1895.5; DB 2; Length 390; 

Best Local Similarity 89.6%; Pred. No. 2e-144; 

Matches 361; Conservative 12; Mismatches 17; Indels 13; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I I M I II I I I , I I I I ! I , I I I I I I I ' I II M Ml II I I. I M I ll I I I I I I I I lh 
Db 1 MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLS 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

I M : 1 1 : 1 1 1 Mill IIIIIIIIIIIMI M II 1 1 1 1 M 1 1 1 II 1 1 II I M hi 

Db 61 SPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVENTNKI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I M I Ml 1 1 II II I II II I Ml M II Ml 1 1 1 Ml M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 121 YEKVKKS PHS I YMLFNTSELREAVPEPVLLSRAELRLLRLKLKAEQHVELYQKYSNDSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

M. II -MINIM I IMIMI 

Db 181 YLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

I 1 1 IM Ml 1 1 1 Ml 1 1 1 M M I Ml M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 241 SSRRGDLAT I HGMNRPFLLLMATPLERAQHLHS SRQRR ALDTNYCFS 287 

Qy 301 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 i 1 1 1 1 1 1 1 r i 

Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI WSCKCS 4 03 

M 1 1 1 1 1 1 II II 1 1 II I II I MM II I Ml II 1 1 1 II I II 1 1 

Db 348 PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 7 
WFMS2 

transforming growth factor beta-1 precursor - mouse 
N ; Alternate names: TGF type 2; TGF-beta 
C; Species: Mus musculus (house mouse) 

C;Date: 04-Dec-1986 #sequence_revis ion 04-Dec-1986 #text_change 24-Nov-1999 
C;Accession: A01396 

R;Derynck, R . ; Jarrett, J. A. ; Chen, E.Y.; Goeddel , D.V. 
J . Biol. Chem. 261, 4377-4379, 1986 



A; Title: The murine transforming growth factor-beta precursor. 
A;Reference number: A01396; MUID: 86168129 ; PMID:3007454 
A; Access ion: AO 13 96 
A /Molecule type: mRNA 
A;Residues: 1-390 <DER> 

A; Cross-references : GB:M13177; NID:g201952; PIDN : AAA40423 . 1 ; PID:g201953 

A; Note: the authors suggest that residues 8-23 could represent the hydrophobic 

core of an amino- terminal signal peptide 

C;Comment: The mature protein is the carboxyl -terminal segment of a precursor 
polypeptide; the active molecule is a dimer of identical polypeptide chains 
linked by an interchain disulfide bond. 
C; Super family: inhibin 

C;Keywords: glycoprotein; growth factor; growth regulation; homodimer; mitogen; 
transformation 

F; 1 -23/Domain : signal sequence tfstatus predicted <SIG> 
F;24-278/Domain: propeptide #status predicted <PRO> 
F ; 244-246/Region: cell attachment (R-G-D) motif 

F;279-390/Product : transforming growth factor beta-1 #status predicted <MAT> 
F;82 / 136, 176/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 85.9%; Score 1837.5; DB 1; Length 390; 

Best Local Similarity 85.9%; Pred. No. 9.2e-140; 

Matches 346; Conservative 15; Mismatches 29; Indels 13; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I MINIMI, 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 ![ 1 1 1 1 i I II! ! ! , 1 1 1 j I III 

Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I I I I h I I I I I I I II I I I I I II I I I I I I I I I : I I I I I I I I I I I I I I I I I I | | : II 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

hi I Ihll MINIMI MINIM Ml 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

Db 121 YEKTKDI SHS I YMFFNTSDI REAVPEPPLLSRAELRLQRLKSSVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEI NGFN 24 0 

I I MM MMIIIIIIIMIMMII : = hill III I Mill IMIIII 

Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDGIQGFRFSAHCSCDSKDNKLHVEINGIS 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 300 

Mill III 1 1 1 ! I i 1 1 1 1 i 1 1 : 1 1 i I J 1 1 1 1 1 MIIIMM 

Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFS 2 87 

Qy 301 STEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM Ml II IM I N 1 1 II 1 1 1 1 1 Nl 1 1 IN 1 1 1 III 1 1 1 II 

Db 28 8 STEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 4 03 

1 1 Ml M II N I 1 1 III III III 1 1 1 M N II III IM II 1 1 

Db 348 PGASASPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 8 
S10219 

transforming growth factor beta-1 precursor - rat 
N; Alternate names: TGF type 2; TGF-beta 



C; Species: Rattus norvegicus (Norway rat) 

C;Date: 12-Feb-1993 #sequence__revision 12-Feb-1993 #text_change 24-Nov-1999 
C;Accession: S10219; PT0023; S02267 

R;Qian, S.W.; Kondaiah, P.; Roberts, A.B.; Sporn, M.B. 
Nucleic Acids Res. 18, 3059, 1990 

A;Title: cDNA cloning by PCR of rat transforming growth factor beta-1. 
A/Reference number: S10219; MUID : 90272425 ; PMID:2349108 
A; Access ion: SI 02 19 
A /Molecule type: mRNA 
A;Residues: 1-390 <QIA> 

A; Cross-references : EMBL:X52498; NID:g57341; PIDN : CAA3674 1 . 1 ; PID:g57342 
R;Okada, F. ; Yamaguchi, K. ; Ichihara, A.; Nakamura, T. 
J. Biochem. 106, 304-310, 1989 

A;Title: Purification and structural analysis of a latent form of transforming 
growth factor-beta from rat platelets. 

A;Reference number: PT0023; MUID : 90036779 ; PMID-.2478527 
A; Access ion: PT0023 
A;Molecule type: protein 

A; Residues: 30-32, 'X', 34-38, *Q' ,40-42, 'X* ,44 <OKA> 
R;Okada, F.; Yamaguchi, K. ; Ichihara, A.; Nakamura, T . 
FEBS Lett. 242, 240-244, 1989 

A; Title: One of two subunits of masking protein in latent TGF-beta is a part of 
pro-TGF-beta . 

A/Reference number: S02267; MUID : 89121078 ; PMID:2914605 
A /Accession: S02267 
A;Molecule type: protein 

A;Residues: 30-32, 'X 1 ,34-38, 'Q' ,40-42, 'X' ,44 <OK2> 
C; Superfamily : inhibin 

C; Keywords: glycoprotein; growth factor; integrin binding 
F; 1-29/Domain: signal sequence #status predicted <SIG> 
F;30-278/Domain: propeptide #status experimental <PRO> 
F;244-246/Region: cell attachment (R-G-D) motif 

F; 279-390/Product : transforming growth factor beta-1 #status predicted <MAT> 
F;82, 136, 176/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 85.9%; Score 1837.5; DB 2; Length 390; 

Best Local Similarity 85.9%; Pred. No. 9.2e-140; 

Matches 346; Conservative 14; Mismatches 30; Indels 13; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIIIIMIIIII I II I II I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 1 MP PSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

M II I I : I II I II II II II II II II I I I II M : I I I I I I I I I I I II I I I M I h 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

III I Ihll I I I I I I I I I II lllllllll I I llllllllllllhlll 
Db 121 YDKTKDI THS I YMFFNTSDI REAVPEPPLLSRAELRLQRFKSTVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

II I I II hhllllllllllllllll = = hill III I Mill IIMIII 

Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDG I QGFRFSAHCSCDSKDNVLHVE I NGI S 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 300 

Mill III 1 1 III lllllllll Ml III I Mill lllllllll 



Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR 



ALDTNYCFS 28 7 



Qy 3 01 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 3 60 

M 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 II II I II 1 1 1 1 1 1 

Db 288 STEPCNCCVRQLYI DFRKDLGWKW I HEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 34 7 

Qy 361 PGASAAPCCVPQALEPLP I VY YVGRKPKVEQLSNM I VRS CKCS 4 03 

1 , 1 1 M M 1 1 1 1 1 1 1 1 M I 1 1 1 . 1 1 1 M . h 1 1 1 1 1 II I M 

Db 348 PGASASPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 9 
A40057 

transforming growth factor beta-1 precursor - bovine (fragment) 

N;Alternate names: beta-TGF; cartilage-inducing factor-A; EGF-dependent TGF or 

dEGF; MGF-b; milk growth factor b; TGF-type II 

C; Species: Bos primigenius taurus (cattle) 

C;Date: 28-Feb-1992 #sequence_revision 28-Feb-1992 #text_change 16-Jul-1999 
C;AccesSion: A40057; A42320; A05284; A24322; B61439 

R;Van Obberghen-Schilling, E.; Kondaiah, P.; Ludwig, R.L.; Sporn, M.B.; Baker, 
C.C. 

Mol . Endocrinol- 1, 693-698, 1987 

A;Title: Complementary deoxyribonucleic acid cloning of bovine transforming 
growth f actor-betal . 

A /Reference number: A40057; MUID : 91042552 ; PMID:3153459 
A; Access ion : A4 0057 
A /Molecule type: mRNA 
A;Residues: 1-315 <VAN> 

A; Cross-references : GB:M36271; NID:gl63747; PIDN: AAA30778 . 1 ; PID:gl63748 
R;Ogawa, Y. ; Schmidt, D.K.; Dasch, J.R.; Chang, R.J.; Glaser, C.B. 
J. Biol. Chem. 267, 2325-2328, 1992 

A;Title: Purification and characterization of transforming growth f actor-beta2 . 3 

and -betal.2 heterodimers from bovine bone. 

A;Reference number: A42320; MUID: 92129307 ; PMID: 1733936 

A; Accession: A4232 0 

A;Molecule type: protein 

A;Residues: 204 -209 , 'X ' , 211 -217 <OGA> 

R;Roberts, A.B.; Anzano, M.A.; Meyers, C.A.; Wideman, J. ; Blacher, R. ; Pan, 
Y.C.E.; Stein, S.; Lehrman, S.R.; Smith, J.M.; Lamb, L.C.; Sporn, M.B. 
Biochemistry 22, 5692-5698, 1983 

A;Title: Purification and properties of a type beta transforming growth factor 
from bovine kidney. 

A; Reference number: A05284; MUID : 84 104793 ; PMID:6607069 
A /Accession: AO 52 84 
A;Molecule type: protein 
A;Residues: 204-218 <ROB> 

R;Seyedin, S.M.; Thompson, A.Y.; Bentz, H. ; Rosen, D.M.; McPherson, J.M.; Conti, 
A.; Siegel, N.R.; Galluppi, G.R.; Piez, K . A . 
J. Biol. Chem. 261, 5693-5695, 1986 

A;Title: Cartilage-inducing factor-A. Apparent identity to transforming growth 
factor-beta . 

A;Reference number: A24322; MUID : 861 95954 ; PMID:3754555 
A; Access ion: A24 322 
A;Molecule type: protein 
A;ResidueS: 204-233 <SEY> 

R;Jin, Y. ; Cox , D.A.; Knecht , R . ; Raschdorf, F . ; Cerletti, N. 
J. Protein Chem. 10, 565-575, 1991 



A;Title: Separation, purification, and sequence identification of TGF-betal and 
TGF-beta2 from bovine milk. 

A; Reference number: A61439; MUID : 92189724 ; PMID:1799413 
A; Accession : B61439 
A;Molecule type: protein 

A;Residues: 204-209, 'X' ,211-217, 'XX' ,220-232 <JIN> 

C; Comment: This polypeptide is composed of two polypeptide chains cross-linked 
by disulfide bonds. It has been found in neoplastic and non-neoplast ic tissues. 
C; Comment: Type II TGF does not bind to the EGF receptor and lacks intrinsic 
mitogenic activity, but in soft agar, it reacts synergistically with either type 
I TFG or EGF, and induces cell proliferation. Cells grown in monolayer do not 
respond in a similar manner to these growth factors, but morphologically do 
acquire a transformed phenotype. 
C; Super family : inhibin 

C; Keywords: glycoprotein; growth factor; heterodimer 

F; 204 -3 15/ Product : transforming growth factor beta-1 #status experimental <MAT> 
F; 7, 61 , 101/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 73.6%; Score 1574.5; DB 2; Length 315; 

Best Local Similarity 89.9%; Pred . No. 9e-119; 

Matches 295; Conservative 9; Mismatches 11; Indels 13; Gaps 1; 

Qy 76 AVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLF 135 

hlllMIIMMMM I IMMMIIIIMIIIMM 11:1111 I : 11=11 I 
Db 1 AILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI YDKMKSSSHSI YMFF 60 

Qy 136 NTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEW 195 

M II II II M II II II I h M 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 II h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 61 NTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEW 120 

Qy 196 LSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNR 255 

MIIIMMIIIIMM MINIMI I II M II I h 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 121 LSFD VTGWRQWLTRREE I EGFRLSAHCSCDSKDNTLQVD I NGFSSGRRGDLATI HGMNR 18 0 

Qy 256 PFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDF 315 

1 1 1 1 ; i ; i m : 1 1 1 mini miimmmiimmimimm 

Db 181 PFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKNCCVRQLYIDF 22 7 

Qy 316 RKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALE 3 75 

I I M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 

Db 228 RKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALE 287 

Qy 376 PLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 03 

I I I r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 

Db 288 PLPIVYYVGRKPKVEQLSNMIVRSCKCS 315 



RESULT 10 
A41918 

transforming growth factor beta-4 precursor - chicken (fragment) 
N ; Alternate names: TGF-beta 4 
C;Species: Gallus gallus (chicken) 

C;Date: 31-Dec-1993 #sequence_revision 31-Dec-1993 #text_change 16-Jul-1999 

C;Accession: A41918; A34941; S03110 

R;Burt, D.W.; Jakowlew, S.B. 

Mol. Endocrinol. 6, 989-992, 1992 



A;Title: Correction: a new interpretation of a chicken transforming growth 
factor-beta 4 complementary DNA. 

A; Reference number: A41918; MUID : 92357039 ; PMID:1353860 
A; Accession: A41918 
A /Molecule type: mRNA 
A;Residues: 1-373 <BUR> 

A; Cross-references : GB:M31160; GB:X08012; GB:S41706; NID : gl262437 ; 
PIDN:AAB05 637 . 1; PID:gl2624 38 

A/Note: sequence extracted from NCBI backbone (NCBIN: 110186, NCBIP : 110187) 
A/Note: this report corrects and reinterprets the sequence from reference A34 941 
R;Jakowlew, S.B.; Dillard, P. J.; Sporn, M.B.; Roberts, A.B. 
Mol. Endocrinol. 2, 1186-1195, 1988 

A;Title: Complementary deoxyribonucleic acid cloning of a messenger ribonucleic 
acid encoding transforming growth factor beta-4 from chicken embryo 
chondrocytes . 

A;Reference number: A34941; MUID: 89112198 ; PMID:2464131 
A /Access ion: A34 941 
A /Molecule type: mRNA 

A/ Residues : ' MDPMSIGPKSCGGSPWRPPGTAPWSIGSRRATASSSCSTSSRVRAEVGGRAL ' , 122- 

209, 'D' ,211-373 <JAK> 

A; Cross-references : EMBL:X08012 

A/Note: this sequence has been corrected in A41918 
C ; Superf amily : inhibin 

C; Keywords: glycoprotein/ growth factor 

F/l/Domain: signal sequence (fragment) #status predicted <SIG> 
F;223-225/Region: cell attachment (R-G-D) motif 

F,-260-373/Product : transforming growth factor beta-4 #status predicted <MAT> 
F ; 54 , 10 9 , 153 /Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 49.0%/ Score 1049/ DB 2; Length 373/ 

Best Local Similarity 53.5%/ Pred. No. 1.8e-76; 

Matches 207/ Conservative 52/ Mismatches 100/ Indels 28/ Gaps 8; 

Qy 3 0 LSTCKTIDMELVKRKRI EAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVA 8 9 

III h MM I MINIMUM III II Ml : II I I h I I I I I I I - - "• 
Db 2 LSTCQRLDLEAAKKKRI EAVRGQI LSKLRLTAPPPASETPPRPLPDDVRALYNSTQELLK 61 

Qy 90 GES- VEPEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPV 148 

: : | |: :|:|||: |: | = : = := ||:: :|| | | | 
Db 62 QRARLRPPPDGPDEYWAKELRRI PMETTWDGAMEHWQPQSHSIFFVFNVSRARRG-GRPT 120 

Qy 14 9 LLSRAELRLLRLKLK VEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGV 2 03 

II llllhll I II :|||| I I Mill I : ■ MINIM 

Db 121 LLHRAELRMLRQKAAADSAGTEQRLELYQGYGNASWRYLHGRSVRATADDEWLSFDVTDA 18 0 

Qy 204 VRQWLTRREAI EGFRLSAHSSSD SKDNTLHVEI NGFNSGRRGDLATI HGMNR - - PFL 258 

MM: I : |: II I = = : I II I ! ! I M h = 

Db 181 VHQWLSGSELLGVFKLSVHCPCEMGPGHAEEMRISIEGFEQ-QRGDMOSIAKKHRRVPYV 2^9 

Qy 2 59 LLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCF - - SSTEKNCCVRQLYI DFR 316 

I II I Ml II I M II llhlll : IMMM MUM 

Db 24 0 LAMALPAERANELHSARRRR DLDTDYCFGPGTDEKNCCVRPLYI DFR 286 

Qy 317 KDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEP 376 

III IMIIMIM I I I I ^ I I I 1 ' I I I I I h I I I I I M I I I I I I I I I I II I I I Ml 

Db 287 KDLQWKWIHEPKGYMANFCMGPCPYIWSADTQYTKVLALYNQHNPGASAAPCCVPQTLDP 34 6 



Qy 377 LPIVTYVGRKPKVEQLSNMI VRSCKCS 403 

I I I ' I I I I I I I I I I I I I h I I I I 
Db 34 7 LP 1 1 YYVGRNVRVEQLSNMWRACKCS 373 



RESULT 11 
A34939 

transforming growth factor beta-3 precursor - chicken 
C; Species: Gallus gallus (chicken) 

C;Date: 13-Jul-1990 #sequence_revis ion 13-Jul~1990 #text_change 16-Jul-1999 
C;Accession: A34939; S25850; S36125; S36124; 151181 

R;Jakowlew, S.B.; Dillard, P. J.; Kondaiah, P.; Sporn, M.B.; Roberts, A . B . 
Mol . Endocrinol. 2, 747-755, 1988 

A;Title: Complementary deoxyribonucleic acid cloning of a novel transforming 
growth factor-beta messenger ribonucleic acid from chick embryo chondrocytes. 
A;Reference number: A34939; MUID: 89096966 ; PMID:3211158 
A /Accession : A34 93 9 

A;Status: preliminary; not compared with conceptual translation 
A;Molecule type: mRNA 
A;Residues: 1-412 <JAK> 

A; Cross -references : GB:M31154; NID:g212758; PIDN: AAA4 9089 . 1 ; PID:g212759 
R;Burt, D.W.; Paton, I.R.; Dey, B.R. 
J. Mol. Endocrinol. 7, 175-183, 1991 

A;Title: Comparative analysis of human and chicken transforming growth factor- 
beta-2 and -beta-3 promoters. 

A/Reference number: S25850; MUID : 92134496 ; PMID: 1840616 
A ,-Access ion: S2 58 50 
A; Status : preliminary 
A /Molecule type: DNA 
A/Residues: 1-117 <BUR> 

A; Cross-references : EMBL:X58127; NID:g63815; PIDN : CAA4 1128 . 1 ; PID:g63816 
A /Access ion: S3 6 12 5 

A/Status : preliminary; nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 
A/Residues : 119-172 <BU2> 

A; Cross-references : EMBL:X60055; NID:g396688; PIDN: CAA42653 . 1 ; PID:g396689 
A/Note: the nucleotide sequence was submitted to the EMBL Data Library, June 
1991 

A /Access ion : S3 6124 

A; Status: preliminary; nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 

A;ResidueS: 173-322, ' ELPT ' ,327-412 <BU3> 
A; Cross -references : EMBL:X60091 

A;Note: the nucleotide sequence was submitted to the EMBL Data Library, June 
1991 

R;Jakowlew, S.B.; Lechleider, R.; Geiser, A.G.; Kim, S.J. / Santa -Coloma , T . A . ,- 
Cubert, J.; Sporn, M.B.; Roberts, A . B . 
Mol. Endocrinol. 6, 1285-1298, 1992 

A;Title: Identification and characterization of the chicken transforming growth 
factor-beta 3 promoter. 

A/Reference number: 151181; MUID : 93024487 ; PMID: 1406706 
A; Access ion : 151181 
A; Status : preliminary 
A; Molecule type: DNA 
A;Residues: 1-117 <JA2> 

A; Cross-references : GB:S46000; NID:g257l72; PIDN : AAB23575 . 1 ; PID:g257173 
C;Genetics : 



A;Introns: 216/1; 252/1; 309/2; 360/3 

A; Note: list of introns may be incomplete 

C; Superfamily : inhibin 

C; Keywords: glycoprotein; growth factor; growth regulation; homodimer; mitogen 
F; 1-22/Domain : signal sequence #status predicted <SIG> 
F;23-300/Domam: propeptide #status predicted <PRO> 

F; 301-412/ Product : transforming growth factor beta-3 #status predicted <MAT> 
F; 74 , 142/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 4 0.8%; Score 873.5; DB 2; Length 412; 

Best Local Similarity 46.6%; Pred. No. 2.5e-62; 

Matches 198; Conservative 56; Mismatches 114; Indels 57; Gaps 14; 

Qy 15 LLWLLVLTPGRPAAGLSTCKT1DMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 

h I :h : 11 = 1 MIH = h I M I I I I I I I I I I I I I III I I :| 

Db 9 LVTjLSLLSFATVSLALSSCTTLDLEHI KKKRVEAI RGQI LSKLRLTSPPE - - SVGPAHVP 66 

Qy 75 EAVLALYNSTRDRVAGESVEPEPE PEADYYAKEVTRVLMVE SGNQIYDKF 124 

'lllllllh = I U I I l-llllh = I- h = 

Db 67 YQILALYNSTRELL- -EEMEEEKEESCSQENTESEYYAKEIHKFDMIQGLPEHNELGI CP 124 

Qy 125 KGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND 177 

II ::: II I I I III hlh : II Hhl I 

Db 125 KGVTSNVFR - FNVS SAEKNSTNLFRAEFRVLRVPNPSSKRSEQRI ELFQI LRPDEHI 180 

Qy 178 -SWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKD 22 9 

MM I : III II I Ml I I hi I Ih I Ml : I 

Db 181 AKQRYLSGRNVQTRGSPEWLSFDVTDTVREWLLHRESNLGLEISIHCPCHTFQPNGDILE 240 

Qy 23 0 N TLHVE I NGFN SG RRGDLATI HGMNRPFLLLMATPLERAQH- -LHSSRHRR 278 

I I := I M MM : =: I MM I M I I M 

Db 241 NLHEVLEIKFKGIDSEDDYGRGDLGRLKKQKDLHNPHLILMMLPPHRLESPTLGGQRKKR 300 

Qy 279 ALDTNDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGP 338 

MM MM • I M M 1 1 M HUM II HIM Ml II MM M 

Db 3 01 ALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYFANFCSGP 34 7 

Qy 33 9 CP Y I WSLDTQYSKVLALYNQHNPGASAAPCCVPQALE PLP I VY Y VGRKPKVEQLSNM I VR 3 98 

MM I M M M Ml II 1 1 1 : 1 ! 1 1 1 1 MM MIMM IMMMIhh 

Db 348 CPYLRSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWK 4 07 

Qy 399 SCKCS 4 03 

Mill 

Db 4 08 SCKCS 412 



RESULT 12 
B61036 

transtorming growth factor beta-5 precursor - African clawed frog 
C; Species: Xenopus laevis (African clawed frog) 

C;Date: 31-Dec-1993 #sequence_revision 03-Feb-1994 #text_change 16-Jul-1999 
C;Accession: A34929; B61036 

R;Kondaiah, P.; Sands, M.J.; Smith, J.M. ; Fields, A.; Roberts, A.B.; Sporn, 
M.B. ; Melton, D.A. 

J. Biol. Chem. 265, 1089-1093, 1990 

A;Title: Identification of a novel transforming growth factor-beta (TGF-beta5) 
mRNA in Xenopus laevis. 



A/Reference number: A34929; MUID : 901 10090 ; PMID: 2295601 
A; Accession : A34 92 9 
A; Molecule type: mRNA 
A; Residues; 1-382 <KON> 

A; Cross-references: GB:J05180; NID:g214821; PIDN: AAA49968 . 1 ; PID:g214822 

R; Roberts, A.B.; Rosa , F . ; Roche, N.S.; Coligan, J.E.; Garfield, M. ; Rebbert , 

M.L.; Kondaiah, P.; Danielpour, D. ; Kehrl , J.H.; Wahl , S.M.; Dawid, I.B.; Sporn, 

M.B. 

Growth Factors 2, 135-147, 1990 

A;Title: Isolation and characterization of TGF-beta2 and TGF-beta5 from medium 
conditioned by Xenopus XTC cells. 

A;Reference number: A61036; MUID : 90253806 ; PMID: 2340184 

A /Access ion: B61036 

A; Molecule type: protein 

A /Residues : 271-276, 'X* ,278-284, 'XX' , 2 87-299 <ROB> 
C; Superf amily : inhibin 
C; Keywords : growth factor 

F ; 271 -382/Product : transforming growth factor beta-5 #status experimental <MAT> 

Query Match 40.7%; Score 870.5; DB 2; Length 382; 

Best Local Similarity 46.3%; Pred. No. 4e-62; 

Matches 190; Conservative 55; Mismatches 122; Indels 43; Gaps 11; 

Qy 9 LPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAI RGQI LS KLRLAS P PSQGDV 68 

: :| II I I I I I Mill MM h-IIMIIIIMIMM I I 

Db 1 MEVXWMLLVLLVLHLSSLAMSLSTCKAVT)MEEVRKRRIEAIRGQILSKLKLDKTPDV-DS 59 

Qy 69 PPGPLPEAVLALYNSTRDRVAGESVEPE PEPEADYYAKEVTRVLMVESGNQI YDK 12 3 

:| : Mill : : - | : I I I I h I I II - I 

Db 60 EKMTVPSEAI FLYNSTLEVI REKATREEEHVGHDQNI QDYYAKQVYR FES I TELED - 115 

Qy 124 FKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKL- -KVEQHVELYQKYSNDSW- - 179 

I II I Ml I II Mlh : = ::| =M= II : 

Db 116 HEFKFKFNASHVRENVGMNSLLHHAELRMYKKQTDKNMDQRMELFWKYQENGTTH 170 

Qy 180 -RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDNTLHVE 235 

III :: : I II M I II I I M I I I I I I = Ml 
Db 171 SRYLESKYITPVTDDEWMSFDVTKTVNEWLKRAEENEQFGLQPACKCPTPQAKD ID 226 

Qy 236 INGFNSGRRGDLATIHGM- -NRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKAL 293 

III llllh: MM- : I II HUM 
Db 227 IEGF-PALRGDLASLSSKENTKPYLMITSMPAERIDTVTSSRKKR GV 272 

Qy 294 DTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVL 353 

III : Mlh IIIMIIMIIIIIMIIM Ml IMIIIMMIIMI 

Db 273 GQEYCFGNNGPNCCVKPLYINFRKDLGWKWIHEPKGYEANYCLGNCPYIWSMDTQYSKVL 332 

Qy 354 ALYWQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 03 

M I M M II 1 1 M 1 1 1 1 IMIMMMM M MM II Mi M M 

Db 333 SLYNQNNPGASI SPCCVPDVLEPLPI I YYVGRTAKVEQLSNMWRSCNCS 3 82 



RESULT 13 
S01825 

transforming growth factor beta -3 precursor - pig 
C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 30-Sep-1989 #sequence_revision 30-Sep-1989 #text_change 16-Jul-1999 



C; Access ion: SO 182 5 

R;Derynck, R. ; Lmdquist, P.B.; Lee, A.; Wen, D. ; Tamm, J.; Graycar, J.L.; Rhee, 
L . ; Mason, A.J.; Miller, D.A. ; Coffey, R.J. ; Moses, H.L.; Chen, E.Y. 
EMBO J. 7, 3737-3743, 1988 

A;Title: A new type of transforming growth factor-beta, TGF-beta3. 
A;Reference number: S01824; MUID: 89091120 ; PMID:3208746 
A, -Access ion: S01825 
A; Molecule type: mRNA 
A;Residues: 1-409 <DER> 

A/Cross-references: EMBL:X14150 ; NID:g2127; PIDN : CAA32363 . 1 ; PID:g2128 
C; Superf amily : inhibin 
C; Keywords: growth factor 

F ; 1 - 2 5 / Doma in : signal sequence #status predicted <SIG> 
F;26-297/Domain: propeptide #status predicted <PRO> 

F;298-409/Product : transforming growth factor beta-3 #status predicted <MAT> 

Query Match 39.1%; Score 836; DB 2; Length 409; 

Best Local Similarity 45.0%; Pred. No. 2.6e-59; 

Matches 190; Conservative 58; Mismatches 122; Indels 52; Gaps 14; 



Qy 15 LLWLLVLTPGRPAAGLSTCKT I DMEL VKRKR I EA I RGQ I LS KLRLAS P PSQGDVP PG PLP 74 

hi =1 : MM hi : III I I I I I I I I M I I Ml : I 

Db 7 LWLALLNFATVSLSMSTCTTLDFDHIKRKRVEA1RGQILSKLRLTSPPDPSML--ANIP 64 

Qy 75 EAVLALYNSTR DRVAGESVE - - PEPEPEADY YAKEVTRVLMV ESGNQI YDKFKG 126 

Db 65 TQVLDLYNSTRELLEEVHGERGDDCTQENTESEYYAKEI YKFDMI QGLEEHNDLAVCPKG 124 

Qy 127 TPHSLYMLFNTS ELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

:: || I : : | ||| hlh : II Mhl I 

Db 125 ITSKI FR-FNVSSVEK NETNLFRAEFRVLRMPNPSSKRSEQRI ELFQILQPDEHIAK 180 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDN- 230 

Ih = I : MINIM lh|| Mh I Ml : I M 

Db 181 QRYIDGKNLPTRGAAEWLSFDVTDTVREWLLRRESNLGLEISIHCPCHTFQPNGD1LENI 240 

Qy 231 - -TLHVEINGFNS GRRGDLATI HGM- -NRPFLLLMATPLERAQH- -LHSSRHRRALD 281 

= =: I M MM : : I hll I M ' 1 = 1=1 
Db 241 QEVMEIKFKGVDSEDDPGRGDLGRLKKKKEHSPHL1LMM1 PPDRLDNPGLGAQRKKR 297 

Qy 282 TNDYKDDDDKALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPY 341 

I ! 1 1 1 1 1 1 : Mill, ! 1 1 1 1 h 1 1 1 II i: 1 1 U I h I II Mill 

Db 298 ALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPY 34 7 

Qy 342 I WSLDTQYSKVLALYNQHNPGASAAPCCVPQALE PLP I VYYVGRKPKVEQLSNM I VRSCK 401 

■III II Ml II Mhllllll MM IMIMI MMMIhhlll 

Db 348 LRSADTTHSSVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTAKVEQLSNMWKSCK 407 

Qy 4 02 CS 4 03 

II 

Db 408 CS 409 



RESULT 14 
A41397 

transforming growth factor beta-3 precursor - mouse 
C; Species: Mus musculus (house mouse) 



C;Date: 03-Apr-1992 #sequence_revision 03-Apr-1992 #text_change 16-Jul-1999 
C;Accession: A41397; A61039; A61225 

R;Miller, D.A. ; Lee, A.; Matsui, Y. ; Chen, E.Y.; Moses, H.L.; Derynck, R. 
Mol. Endocrinol. 3, 1926-1934, 1989 

A;Title: Complementary DNA cloning of the murine transforming growth factor- 

beta3 (TGFbeta3) precursor and the comparative expression of TGFbeta3 and 

TGFbetal messenger RNA in murine embryos and adult tissues. 

A; Reference number: A41397; MUID : 90190650 ; PMID:2628730 

A; Access ion : A413 97 

A;Molecule type: mRNA 

A;Residues: 1-410 <MIL> 

A; Cross-references : GB:M32745; NID:g201949; PIDN : AAA40422 . 1 ; PID:g201950 
R;Denhez, F. ; Lafyatis, R. ; Kondaiah, P.; Roberts, A.B.; Sporn, M.B. 
Growth Factors 3, 139-146, 1990 

A;Title: Cloning by polymerase chain reaction of a new mouse TGF-beta, mTGF- 
beta3 . 

A;Reference number: A61039; MUID : 91000714 ; PMID:2206556 
A/Accession : A61039 
A; Molecule type: mRNA 
A/Residues : 1-410 <DEN> 

R;Watrin, F . ; Scotto, L . ; Assoian, R.K.; Wolgemuth, D.J. 
Cell Growth Differ. 2, 77-83, 1991 

A/Title: Cell lineage specificity of expression of the murine transforming 

growth factor beta-3 and transforming growth factor beta-1 genes. 

A;Reference number: A61225; MUID: 91299576 ; PMID:2069871 

A; Access ion : A6122 5 

A;Status: translation not shown 

A;Molecule type: mRNA 

A;Residues: 285-410 <WAT> 

C; Super family: inhibin 

C; Keywords: glycoprotein; growth factor; growth regulation 
F; 1-21 /Domain : signal sequence tfstatus predicted <SIG> 
F;22-298/Domain: propeptide #status predicted <PR0> 
F;259-261/Region: cell attachment (R-G-D) motif 

F;299-410/Product : transforming growth factor beta-3 #status predicted <MAT> 
F; 72 , 133 , 140/Binding site: carbohydrate (Asn) (covalent) flstatus predicted 

Query Match 38.9%; Score 832.5; DB 2; Length 410; 

Best Local Similarity 44.7%; Pred. No. 4.9e-59; 

Matches 188; Conservative 58; Mismatches 126; Indels 49; Gaps 13; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLASPPSQGDVPPGPLP 74 

h I :| = I I I I hi : H hi I I ' I | || Ml | | III : 

Db 7 LWLALLNLATISLSLSTCTTLDFGHIKKKRVEAIRGQILSKLRLTSPPEPSVMT--HVP 64 

Qy 75 EAVLALYNSTR DRVAGESVE - - PEPEPEADYYAKEVTRVLMVE SGNQI YDKFKG 126 

lllllllll : : II I : hhlllh : h= h= II 

Db 65 YQVIjALYNSTRELLEEMHGEREEGCTQETSESEYYAKEI HKFDMI QGLAEHNELAVCPKG 124 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

-III:: I III I : I I : = II :|hl I 

Db 125 ITSKVFR- FNVSSVEK NGTNLFRAEFRVLRVPNPSSKRTEQRI ELFQILRPDEHIAK 180 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAI EGFRLSAH SSSDSKDN- 230 

I h : I : MINIM IhM II M I : II I h 

Db 181 QRY I GGKNLPTRGTAEWLSFDVTDTVREVJLLRRESNLGLE I S I HCPCHTFQPNGD I LENV 240 



Qy 231 - -TLHVE I NGFN S GRRGDLAT I HGM NRPFLLLMATPLERAQHLHS SRHRRALDT 282 

: :: I :: |||| = ! Ill I h 

Db 241 HEVMEIKFKGVDNEDDHGRGDLGRLKKQKDHHNPHLILMMIPPHRLDSPGQGSQRK- - - - 296 

Qy 283 NDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYI 342 

MIIIIMI : hlllll lllllhlllllhlllMhllll llllh 
Db 297 KRALDTNYCFRNLEENCCVRPLY I DFRQDLGWKWVHEPKGYYANFCSGPCPYL 34 9 

Qy 343 WSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKC 4 02 

I II =1 II Ml II 1 1 hill III MM hlllll Mill II Ihh I III 

Db 350 RSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKC 4 09 

Qy 403 S 403 

I 

Db 410 S 410 



RESULT 15 
A55706 

transforming growth factor beta-3 precursor - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 03-Mar-1995 #sequence_revis ion 03-Mar-1995 #text_change 16-Jul-1999 
C;Accession: A55706; B40699; S36042 

R;Wang, J.; Kuliszewski, M . ; Yee, W. ; Sedlackova, L. ; Xu, J.; Tseu, I.; Post, M . 
J. Biol. Chem. 270, 2722-2728, 1995 

A; Title: Cloning and expression of glucocort icoid-induced genes in fetal rat 

lung fibroblasts. Transforming growth f actor-beta-3 . 

A;Reference number: A55706; MUID : 95 155340 ; PMID:7852342 

A; Accession: A55706 

A; Molecule type: mRNA 

A;ResldueS: 1-410 <WAN> 

A; Cross-references : GB:U03491 

A;Note: it is uncertain whether Met-1 is the initiator 
R;McKinnon, R.D.; Piras, G.; Ida Jr., J. A.; Dubois -Dalcq, M. 
J. Cell Biol. 121, 1397-1407, 1993 

A;Title: A role for TGF-beta in oligodendrocyte differentiation. 
A; Reference number-. A40699; MUID : 93286190; PMID:8509457 
A;Accession : B40699 
A; Status : preliminary 
A /Molecule type: mRNA 
A;ResidueS: 157-211 <MCK> 

A; Cross-references: EMBL:X71903; NID:g311326; PIDN : CAA50722 . 1 ; PID:g311327 
C; Super family : inhibin 

Query Match 38.9%; Score 831.5; DB 2; Length 410; 

Best Local Similarity 44.7%; Pred. No. 5.9e-59; 

Matches 188; Conservative 58; Mismatches 126; Indels 49; Gaps 13; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLASPPSQGDVPPGPLP 74 

l : I • I : I I I I hi MMhlllMIMIIIII Ml : =| 

Db 7 LWLALLNLAT VSLSLSTCTTLDFGHI KKKRVEA I RGQ I LSKLRLTS P PE PS VMT - - HVP 64 

Qy 75 EAVLALYNSTR DRVAGESVE- - PEPEPEADYYAKEVTRVLMVE SGNQI YDKFKG 126 

MINIMI : : II I : hMMIh : h= h= II 

Db 65 YQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKE I HKFDM I QGLAEHNELAVCPKG 124 



Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 



Db 



125 I TSKVFR - FNVSSVEK NGTNLFRAEFRVLRVPNPSSKRTEQRI ELFQI LRPDEHI AK 180 



Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDN- 230 

Ih : I = llllllll Ihll MM I -II : I =| 

Db 181 QRYI GGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLE I S I HCPCHTFQPNGD I LENV 24 0 

Qy 231 - - TLHVE I NGFNS GRRGDLATI HGM NRPFLLLMATPLERAQHLHSSRHRRALDT 282 

: =: I - Mil = : I hll I I h 
Db 241 HEVMEI KFKGVDNEDDHGRGDLGRLKKQKDHHNPHLI LMMI PPHRLDSPGQGGQRK 296 

Qy 283 NDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYI 342 

.-MINIM : Ml II II II Ml Ml II MM 1 1 Ml Mill I IMIM 

Db 2 97 KRALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYL 34 9 

Qy 343 WSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKC 4 02 

I II = I II III M 1 1 1 1 1 J 1 1 1 MM MIMM MMMMMMIMI 

Db 350 RSSDTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKC 4 09 

Qy 403 S 403 

I 

Db 410 S 410 



Search completed: October 28, 2003, 09:09:49 
Job time : 14.3169 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



October 28, 2003, 00:08:41 ; Search time 8.07747 Seconds 

(without alignments) 
2346.251 Million cell updates/sec 



Title-. 

Perfect score: 
Sequence : 



US-10-017-372E-33 
2139 

1 MAPSGLRLLPLLL PLLWLLV GRKPKVEQLSNMI VRSCKCS 403 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



127863 seqs, 47026705 residues 



Total number of hits satisfying chosen parameters: 



127863 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



SwissProt 41:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No . 


Score 


Match 


Length DB 


ID 


Description 


1 


2032 . 5 


95. 


0 


390 


1 


TGF1 PIG 


P07200 


sus scrofa 


2 


1931 . 5 


90. 


3 


390 


1 


TGF1_SHEEP 


P50414 


ovis aries 


3 


1918.5 


89. 


7 


390 


1 


TGF INHUMAN 


P01137 


homo sap i en 


4 


1916 . 5 


89. 


6 


390 


1 


TGF1_CERAE 


P09533 


cercopithec 


5 


1895.5 


88, 


, 6 


390 


1 


TGF1_CANFA 


P54831 


canis famil 


6 


1841 . 5 


86. 


1 


390 


1 


TGF1 HORSE 


019011 


equus cabal 


7 


1837 . 5 


85. 


.9 


390 


1 


TGF1_M0USE 


P04202 


mus musculu 


8 


1837 . 5 


85. 


, 9 


390 


1 


TGF1 RAT 


P17246 


rattus norv 


9 


1828 . 5 


85. 


. 5 


390 


1 


TGF1 CAVPO 


Q9zly6 


cavia porce 


10 


1574 . 5 


73 , 


,6 


315 


1 


TGF1 BOVIN 


P18341 


bos taurus 


11 


1049 


49. 


, 0 


373 


1 


TGF1_CHICK 


P09531 


gallus gall 


12 


870 . 5 


40. 


,7 


382 


1 


TGF1_XENLA 


P16176 


xenopus lae 


13 


854.5 


39. 


, 9 


412 


1 


TGF3_CHICK 


P16047 


gallus gall 


14 


836 


39. 


. 1 


409 


1 


TGF3__PIG 


P15203 


sus scrofa 


15 


832 . 5 


38. 


.9 


410 


1 


TGF3_MOUSE 


P17125 


mus musculu 


16 


831.5 


38. 


.9 


412 


1 


TGF3_RAT 


Q07258 


rattus norv 


17 


829.5 


38. 


. 8 


412 


1 


TGF3 HUMAN 


P10600 


homo sap i en 


18 


829 


38 , 


.8 


414 


1 


TGF2__M0USE 


P27090 


mus musculu 


19 


814 


38. 


.1 


414 


1 


TGF2 HUMAN 


P08112 


homo sapien 


20 


813.5 


38. 


. 0 


413 


1 


TGF2_XENLA 


P17247 


xenopus lae 


2 1 


8 13 


38. 


.0 


412 


1 


TGF2_CHICK 


P30371 


gallus gall 


22 


810 


37 


. 9 


435 


1 


TGF2 PIG 


P09858 


sus scrofa 


23 


810 


37. 


.9 


442 


1 


TGF2 RAT 


Q07257 


rattus norv 


24 


482 


22 


.5 


112 


1 


TGF2_B0VIN 


P21214 


bos taurus 


25 


327 . 5 


15. 


.3 


375 


1 


GDF8_MELGA 


042221 


meleagris g 


26 


325 . 5 


15. 


.2 


375 


1 


GDF8_CHICK 


042220 


gallus gall 


27 


315.5 


14 


.7 


375 


1 


GDF8_PIG 


018831 


sus scrofa 


28 


314.5 


14 . 


.7 


375 


1 


GDF8_PAPHA 


018828 


papio hamad 


29 


312.5 


14 


.6 


376 


1 


GDF8_RAT 


035312 


rattus norv 


J 0 


"3 11 C 

3 1 1 . b 


14 . 


. 6 


375 


1 


pnnn TIT TUA KT 

(jUr o HUMAN 


014793 


homo sapien 


31 


310 . 5 


14 . 


. 5 


376 


1 


GDF8_M0USE 


008689 


mus musculu 


32 


309.5 


14 


. 5 


375 


1 


GDF8_SHEEP 


018830 


ovis aries 


33 


304 


14 . 


.2 


405 


1 


GDFB_MOUSE 


Q9zlw4 


mus musculu 


34 


304 


14 


.2 


407 


1 


GDFB_HUMAN 


095390 


homo sapien 


35 


300.5 


14 


. 0 


375 


1 


GDF8_B0VIN 


018836 


bos taurus 


36 


284.5 


13. 


.3 


374 


1 


GDF8_BRARE 


042222 


brachydanio 


37 


282 . 5 


13 


.2 


345 


1 


GDFB_RAT 


Q9z217 


rattus norv 


38 


279 


13. 


. 0 


425 


1 


IHBA_SHEEP 


P43032 


ovis aries 


39 


276.5 


12 . 


. 9 


424 


1 


IHBA_PIG 


P03970 


sus scrofa 


40 


274 . 5 


12 


. 8 


424 


1 


IHBA_MOUSE 


Q04998 


mus musculu 


41 


274 . 5 


12 . 


.8 


424 


1 


IHBA_RAT 


P18331 


rattus norv 


42 


274 


12 . 


.8 


425 


1 


IHBA BOVIN 


P07995 


bos taurus 


43 


273 .5 


12 . 


.8 


426 


1 


IHBA HORSE 


P55102 


equus cabal 


44 


272 . 5 


12 . 


.7 


426 


1 


I HBA_HUMAN 


P08476 


homo sapien 


45 


269.5 


12, 


. 6 


424 


1 


IHBA_CHICK 


P27092 


gallus gall 



ALIGNMENTS 



RESULT 1 
TGF1_PIG 

ID TGF1_PIG STANDARD; PRT; 390 AA . 

AC P07200; P08832; 

DT 01-APR-1988 (Rel. 07, Created) 

DT 01-APR-1988 (Rel. 07, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 

OX NCBIjraxID=9823; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RX MEDLINE=87174844 ; PubMed-3470708 ; 

RA Derynck R . , Rhee L . ; 

RT "Sequence of the porcine transforming growth factor-beta precursor."; 

RL Nucleic Acids Res. 15:3187-3187(1987). 

RN [2] 

RP SEQUENCE FROM N.A., AND VARIANT VAL-114. 

RC STRAIN=Miniature swine; 

RX MEDLINE=89054010; PubMed=24 6 13 67 ; 

RA Kondaiah P., van Obberghen-Schill ing E., Ludwig R.L., Dhar R. , 

RA Sporn M.B. , Roberts A.B.; 

RT "cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 

RT Evidence for alternate splicing and polyadenylation . " ; 

RL J . Biol. Chem. 263:18313-18317(1988). 

RN [3] 

RP SEQUENCE FROM N.A., AND VARIANT VAL-114. 

RX MEDLINE=88335639; PubMed=3 16652 0 ; 

RA Jakowlew S.B., Dillard P.J., Sporn M.B., Roberts A.B.; 

RT "Nucleotide sequence of chicken transforming growth factor-beta 1 

RT (TGF-beta 1) . " ; 

RL Nucleic Acids Res. 16:8730-8730(1988). 

RN [4] 

RP SHOWS THAT REF . 3 SEQUENCE IS FROM PIG. 

RA Jakowlew S . B . ; 

RL Unpublished observations (MAR-1996) . 

RN [5] 

RP SEQUENCE FROM N.A. , AND VARIANT VAL-114. 

RA Wimmers K. , Chomdej S. , Ponsuksili S., Schellander K. ; 

RT "Polymorphism in the porcine transforming growth factor beta 1 

RT gene . 11 ; 

RL Submitted (DEC-2001) to the EMBL/ GenBank/DDB J databases. 

RN [6] 

RP SEQUENCE OF 279-322. 

RX MEDLINE=871028 90; PubMed=2 87963 5 ; 

RA Cheifetz S., Weatherbee J. A., Tsang M.L.S., Anderson J.K., Mole J.E., 

RA Lucas R., Massague J.; 

RT "The transforming growth factor-beta system, a complex pattern of 

RT cross-reactive ligands and receptors."; 

RL Cell 48:409-415(1987). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 



CC PROLIFERATION, DIFFERENTIATION , AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES . MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC -!- CAUTION: REF . 3 SEQUENCE WHICH WAS SAID TO ORIGINATE FROM CHICKEN 
CC WHITE LEGHORN, SEEMS (REF. 4) TO ORIGINATE FROM PIG. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; Y00111; CAA68291.1, 

DR EMBL; M23703; AAA64616.1, 

DR EMBL; X12373; CAA30933.1, 

DR EMBL; AF461808; AAL57902.1; -. 

DR PIR; A27512; A27512 . 

DR HSSP; P01137; 1KLA . 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 



DR 


PROSITE; 


PS00250; 


TGF BETA 1; 1. 




KW 


Growth factor; Mitogen; 


Glycoprotein; Signal; Polymorphism. 


FT 


SIGNAL 


1 


23 


POTENTIAL. 




FT 


PROPEP 


24 


278 






FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH 


FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 




FT 


DISULFID 


293 


356 


BY SIMILARITY. 




FT 


DISULFID 


322 


387 


BY SIMILARITY. 




FT 


DISULFID 


326 


389 


BY SIMILARITY. 




FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N- LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE 


(POTENTIAL) . 


FT 


VARIANT 


114 


114 


L -> V. 




FT 


CONFLICT 


6 


7 


LR -> PG (IN REF. 3) 




FT 


CONFLICT 


180 


180 


R - > G (IN REF. 3) . 




FT 


CONFLICT 


237 


237 


N -> NA (IN REF. 3) . 




SQ 


SEQUENCE 


390 AA; 


44294 MW; A6E2C3659FC384 E6 CRC64 ; 



Query Match 95.0%; Score 2032.5; DB 1; Length 390; 

Best Local Similarity 95.8%; Pred. No. 1.9e-152; 



Matches 386; Conservative 1; Mismatches 3; Indels 13; Gaps 



1; 



Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I IIIMIIIIIIIMIIMMIIIMMMIIIIIIIIIIMIMIIMMIIIIIIII 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

I II I I I I I I I I I I I I I I I I II I I I I i I I I I I I I I I I I I I II I I I I I I I I I I I h I I I I I I 
Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMLESGNQI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill 1 1 II II M iM M Ml 1 1 1 II M ! I M 1 1 M'M IIIIMM! II MM 1 1 M I 

Db 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

I II 1 1 1 M M 1 1 II 1 1 1 M 1 II M 1 1 1 1 1 ! 1 1 1 1 1 II II 1 1 1 I MIMMMMIIII 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM Illllllll 
Db 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFS 287 

Qy 3 01 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

M I II M M M I II MMI II 1 1 1 1 1 M M IM 1 1 1 IMI III I M.MMM 1 1 III I ! Ml 

Db 2 88 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 3 61 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 4 03 

MIIMIMIIIIIMIIIIIIIMIIIIIIMMIIIIIIII 

Db 348 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 390 

RESULT 2 
TGF1_SHEEP 

ID TGF1_SHEEP STANDARD; PRT; 3 90 AA. 

AC P50414; 

DT 01-OCT-1996 (Rel . 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Ovis aries (Sheep) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetart iodactyla ; Ruminant ia; Pecora; Bovoidea; 

OC Bovidae; Caprinae; Ovis. 

OX NCBI_TaxID=994 0; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=95121932; PubMed=782 18 09 ; 

RA Woodall C.J., McLaren L.J., Watt N.J.; 

RT "Sequence and chromosomal localisation of the gene encoding ovine 

RT latent transforming growth factor-beta 1 . " ; 

RL Gene 150:371-373(1994). 

RN [2] 

RP SEQUENCE OF 281-390 FROM N.A. 

RC STRAIN=Mer ino ; TISSUE=Skin; 

RX MEDLINE=95268698 ; PubMed=774 962 1 ; 

RA Sutton R., Ward W.G., Raphael K.A. , Cam G.R-; 



RT "Growth factor expression in skin during wool follicle development."; 

RL Comp. Biochem. Physiol. HOB : 697-705 (1995) . 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION , DIFFERENTIATION , AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; X76916; CAA54242.1; -. 

DR EMBL; L36038; AAA31526.1; 

DR PIR; 146463; 146463. 

DR HSSP; P01137; 1KLA . 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; I PRO 03 911; TGFJTGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFbJJ . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PDO 00357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS0 025 0; TGF_BETA__1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


278 


POTENTIAL. 


FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1 . 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) ( POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44291 MW; 


1C247299484D0E57 CRC64 ; 


Query Match 




90.3%; 


Score 1931.5; DB 1; Length 390; 



Best Local Similarity 90.6%; Pred. No. 1.7e-144; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQI LSKLRLA 6 0 

I MIMIIMIIIIIIIhllllll lllllllllllllllll lllllllllllllli 

Db 1 MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKTI DMELVKRKGI EAIRGQI LSKLRLA 60 



Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


288 


Qy 


361 


Db 


348 



SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADY YAKEVTRVLMVESGNQI 12 0 

'Ml 1 1 M 1 1 1 III I M III I II 1 1 1 1 1 1: I IMIIIIIIIMIIIIIIII hi 

SPPSQGDVPPGPLPEAIIALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI 120 

YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

III I : hi Mi I III I II II M M I ' hh I II MM Ml II I I I I I , lh Ml 

YDKMKSSSHS I YMFFNTS ELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

M 1 1 1 1 II I II M II hi 1 1 1 Ml 1 1 1 1 II Mr I IIIIMI MM • 

YLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEIEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 



I MM M I II III MM III IIIIIMM Ml III II I' MM 

SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFS 287 

STEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHN 36 0 

MIMI II I II M I II I II I MMIM M III Ml II MM IIIIMI II M I MM, 

STEI<NCCVRQLY I DFRKCLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHN 347 
PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 03 

! : 1 1 i 1 1 1 1 1 1 . 1 M i 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 i r i 1 : 1 1 1 : 

PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 3 
TGF1_HUMAN 

ID TGF1_HUMAN STANDARD; PRT; 390 AA. . 

AC P0113 7; Q9UCG4; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 01-FEB-1991 (Rel. 17, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 OR TGFB . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=87174845; PubMed=3470709 ; 

RA Derynck R. , Rhee L. , Chen E.Y., van Tilburg A. ; 

RT "Intron-exon structure of the human transforming growth factor-beta 

RT precursor gene."; 

RL Nucleic Acids Res. 15:3188-3189(1987). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDL INE=85296301; PubMed = 3 8 6194 0; 

RA Derynck R., Jarrett J. A. , Chen E.Y., Eaton D.H., Bell J.R., 

RA Assoian R.K., Roberts A.B., Sporn M.B., Goeddel D.V. ; 

RT "Human transforming growth factor-beta complementary DNA sequence and 

RT expression in normal and transformed cells."; 

RL Nature 316:701-705(1985). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Duodenum, and Eye; 



RX MEDLINE=22388257; PubMed=124 77932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H. , Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K. , Farmer A. A., Rubin G.M., Hong L. , 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J. , Helton E . , Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J . , Schmutz J . , Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [4] 

RP SEQUENCE OF 279-390 FROM N . A . 

RC TISSUE=Carcinoma ; 

RA Urushizaki Y. , Niitsu Y. , Terui T. , Koshida Y. , Mahara K. , Kongo Y. , 

RA Urushizaki I., Takahashi Y. , Ito H. ; 

RT "Cloning and expression of the gene for human transforming growth 

RT factor-beta in Escherichia coli. 11 ; 

RL Tumor Res. 22:41-55(1987). 

RN [5] 

RP SEQUENCE OF 279-329. 

RC TISSUE=Bladder carcinoma; 

RX MEDLINE=93229900; PubMed=8471846 ; 

RA Bourdrel L. , Lin C.-H., Lauren S.L., Elmore R.H., Sugarman B.J., 

RA Hu S., WeStCOtt K.R.; 

RT "Recombinant human transforming growth factor-beta 1: expression by 

RT Chinese hamster ovary cells, isolation, and characterization."; 

RL Protein Expr. Purif. 4:130-140(1993). 

RN [6] 

RP SEQUENCE OF 279-301. 

RX MEDLINE=85131019; PubMed=298282 9 ; 

RA Massague J., Like B.; 

RT "Cellular receptors for type beta transforming growth factor. Ligand 

RT binding and affinity labeling in human and rodent cell lines."; 

RL J. Biol. Chem. 260:2636-2645(1985). 

RN [7] 

RP STRUCTURE BY NMR OF 279-3 90. 

RX MEDLINE=93144319; PubMed=8424 942 ; 

RA Archer S.J., Bax A., Roberts A.B., Sporn M.B., Ogawa Y. , Piez K.A. , 

RA Weatherbee J. A., Tsang M.L.-S., Lucas R. , Zheng B.-L., Wenker J., 

RA Torchia D. A. ; 

RT "Transforming growth factor beta 1: NMR signal assignments of the 

RT recombinant protein expressed and isotopically enriched using Chinese 

RT hamster ovary cells."; 

RL Biochemistry 32:1152-1163(1993). 

RN [8] 

RP STRUCTURE BY NMR OF 2 79-390. 



RX MEDLINE=93144320; PubMed=8424 943 ; 

RA Archer S.J., Bax A. , Roberts A.B., Sporn M.B., Ogawa Y . , Piez K.A. , 

RA Weatherbee J. A. , Tsang M.L.-S., Lucas R. , Zheng B.-L., Wenker J., 

RA Torchia D. A. ; 

RT "Transforming growth factor beta 1: secondary structure as determined 

RT by heteronuclear magnetic resonance spectroscopy."; 

RL Biochemistry 32:1164-1171(1993). 

RN [9] 

RP STRUCTURE BY NMR OF 279-390. 

RX MEDLINE=96266150; PubMed=8679613 ; 

RA Hmck A.P., Archer S.J., Qian S.W., Roberts A.B., Sporn M.B., 

RA Weatherbee J. A., Tsang M.L.-S., Lucas R., Zheng B.-L., Wenker J. , 

RA Torchia D. A. ; 

RT "Transforming growth factor beta 1: three-dimensional structure in 

RT solution and comparison with the X-ray structure of transforming 

RT growth factor beta 2 . " ; 

RL Biochemistry 35:8517-8534(1996). 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROLS PROLIFERATION, 
CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinformat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X05839; CAA29283.1; -. 

DR EMBL; X05840; CAA29283.1; JOINED. 

DR EMBL; X05843; CAA29283.1; JOINED. 

DR EMBL; X05844; CAA29283.1; JOINED. 

DR EMBL; X05849; CAA29283.1; JOINED. 

DR EMBL; X05850; CAA29283.1; JOINED. 

DR EMBL; X02812; CAA26580.1; ALT_SEQ. 

DR EMBL; BC001180; AAH01180.1; -. 

DR EMBL; BC000125; AAH00125.1; -. 

DR EMBL; BC022242; AAH22242.1; -. 

DR EMBL; M38449; AAA36735.1; 

DR PIR; A27513; WFHU2 . 

DR PDB; 1KLA; 17-AUG-96. 

DR PDB; 1KLC; 17-AUG-96. 

DR PDB; 1KLD; 17-AUG-96. 

DR Genew; HGNC : 11766 ; TGFB1 . 

DR MIM; 190180; -. 

DR GO; GO: 0006916; P : ant i -apoptos is ; TAS . 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro ; I PRO 039 11; TGF_TGFb . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



Pfam; PF00019; TGF~beta; 1. 
Pfam; PF00688; TGFb_propept ide; 1. 
PRINTS; PR0043 8; GFCYSKNOT . 
PRINTS; PR01423; TGFBETA. 
ProDom; PD000357; TGFb; 1. 
SMART ; SM00204; TGFB; 1. 
PROSITE; PS 002 50; TGF_BETA_1 ; 1. 

Growth factor; Mitogen; Glycoprotein; Signal; 3D-structure . 



SIGNAL 

PROPEP 

CHAIN 

DISULFID 

DISULFID 

DISULFID 

DISULFID 

DISULFID 

CARBOHYD 

CARBOHYD 

CARBOHYD 

SITE 

CONFLICT 

CONFLICT 

STRAND 

TURN 

STRAND 

STRAND 

TURN 

STRAND 

STRAND 

STRAND 

HELIX 

TURN 

STRAND 

TURN 

STRAND 

SEQUENCE 



1 

24 
279 
285 
293 
322 
326 
355 

82 
136 
176 
244 

10 
159 
281 
282 
294 
300 
302 
313 
317 
321 
335 
348 
358 
371 
373 
390 AA; 



23 
278 
390 
294 
356 
387 
389 
355 

82 
136 
176 
246 

10 
159 
281 
287 
296 
300 
305 
313 
317 
323 
346 
349 
370 
372 
387 

44341 MW; 



POTENTIAL. 



TRANSFORMING GROWTH FACTOR BETA 1. 



INTERCHAIN. 

N-LINKED (GLCNAC . . .) (POTENTIAL) 
N- LINKED (GLCNAC. . .) (POTENTIAL) 
N-LINKED (GLCNAC. . .) (POTENTIAL) 
CELL ATTACHMENT SITE (POTENTIAL) . 
L - > P (IN REF. 2) . 
R -> RR (IN REF. 2) . 



75391614250288FE CRC64 ; 



Query Match 89.7%; Score 1918.5; DB 1; Length 390; 

Best Local Similarity 90.6%; Pred. No. 1.8e-143; 

Matches 365; Conservative 10; Mismatches 15; Indels 13; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I 'III MINI I III I II! Illllil II I M llllll MM II IIIMIII Ml 

Db 1 MP PSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKT I DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I IMM MM Ml MIMI Mill lllll I M I II 1 1 M 1 1 1 1 1 M 1 1 1 II M hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 



Qy 

Db 

Qy 

Db 



121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

INN : IMM MMMIMMMMMMMMMMMMMMMMMhMI 

121 YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 



181 



240 



YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 

IIIMIII Ml MM Ml Mi:- MMMMI I MMMI MIMI 

181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEI EGFRLSAHCSCDSRDNTLQVDINGFT 24 0 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 300 

: II : 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 Illlll Illllllll 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 

Qy 301 STEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

' i i i ' i i i i ; 

Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 403 

I II : M I I I ! I I I I I I II I I II I I I I I i II I I I I I I I I I I 
Db 348 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 390 

RESULT 4 
TGF1_CERAE 

ID TGF1_CERAE STANDARD; PRT; 3 90 AA. 

AC P09533; 

DT 01-MAR-1989 (Rel . 10, Created) 

DT 01-MAR-1989 (Rel. 10, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Cercopithecus aethiops (Green monkey) (Grivet) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae ; Cercopithecus. 

OX NCBI_TaxID=9534 ; 

RN [ 1 ] 

RP SEQUENCE FROM N . A . 

RX MEDLINE-87246074; PubMed=34 74 13 0 ; 

RA Sharpies K. , Plowman G.D. , Rose T.M. , Twardzik D.R., Purchio A.F.; 

RT "Cloning and sequence analysis of simian transforming growth 

RT factor-beta cDNA . 11 ; 

RL DNA 6:239-244(1987). 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROL PROLIFERATION, 
CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-lmked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as lonq as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

cc 

DR EMBL; M16658; AAA35369.1; -. 

DR PIR; A26960; A26960. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003 911; TGF TGFb. 



DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb__propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART ; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 16 POTENTIAL. 

FT PROPEP 17 278 

FT CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 285 294 BY SIMILARITY. 

FT DISULFID 293 356 BY SIMILARITY. 

FT DISULFID 322 387 BY SIMILARITY. 

FT DISULFID 326 389 BY SIMILARITY. 

FT DISULFID 355 355 INTERCHAIN (BY SIMILARITY) . 

FT CARBOHYD 82 82 N-LINKED ( GLCNAC . . .) ( POTENTIAL) . 

FT CARBOHYD 136 136 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 176 176 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT SITE 244 246 CELL ATTACHMENT SITE (POTENTIAL) . 

SQ SEQUENCE 390 AA; 44356 MW; DFF63E2BAB44320E CRC64; 

Query Match 89.6%; Score 1916.5; DB 1; Length 390; 

Best Local Similarity 90.3%; Pred . No. 2.5e-143; 

Matches 364; Conservative 10; Mismatches 16; Indels 13; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 

I ! I ; I II 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 I II 1 1 1 II 1 1 1 IM I M I II 1 1 Mill III 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTI DMELVKRKRI ETI RGQI LSKLRLA 6 0 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I I l h I I II I I I I I I I I I I I I I I I I I I I II I I II I II I INI I I I I I ' I II h hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

1 1 II I = Ihll M I! 1 1 M I M 1 1 M I Ml I MM II III MM M I II I M M I II 

Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

M 1 1 1 1 1 II M 1 1 1 II I MM 1 1 1 1 1 M I MM IM: I Mill MM! 

Db 181 YLSNRLLAPSNSPEWLSFDVTGVVRQWLSRGGEI EGFRLSAHCSCDSKDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 300 

M 1 1 1 1 1 1 II II I II 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 III MINIMI 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRR ALDTNYCFS 287 

Qy 3 01 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

IMIMI MMMIMMIMM Mill MMMMMMMMMMMMIIMM 

Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 3 61 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 4 03 

I 1 1 1 1 1 1 1 1 1 ! 1 1 ! ! 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 , 

Db 348 PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNM I VRSCKCS 390 



RESULT 5 
TGF1 CAN FA 



ID TGF1_CANFA STANDARD; PRT; 3 90 AA . 

AC P54831; 

DT 01-OCT-1996 (Rel . 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1. 

OS Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE= Jugular vein endothelial; 

RX MEDLINE=95237630; PubMed=772 1 1 1 0 ; 

RA Manning A.M., Auchampach J. A., Drong R.F., Slightom J.L. ; 

RT "Cloning of a canine cDNA homologous to the human transforming growth 

RT factor-beta 1-encodmg gene."; 

RL Gene 155:307-308(1995). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE . TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; L34956; AAA51458.1; -. 

DR PIR; JC4023; JC4023. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFbJtf. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PRO 043 8; GFCYSKNOT . 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD0 0 03 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF__BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 23 POTENTIAL. 

FT PROPEP 24 278 BY SIMILARITY. 

FT CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 



FT HTCTTT T7TH 






a x b 1 M I iuAK III. 




FT DISULFID 


293 


356 


BY SIMILARITY. 




FT DISULFID 


322 


387 


BY SIMILARITY. 




r\T CT TT CTPl 

ri UloUJ-irlLJ 


T O C 


389 


BY SIMILARITY. 




FT DISULFID 


355 


355 


INTERCHAIN. 




FT CARBOHYD 


82 


82 


N- LINKED (GLCNAC. . . 


) (POTENTIAL) 


FT CARBOHYD 


136 


136 


N-LINKED (GLCNAC . . . 


) (POTENTIAL) 


FT CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . . 


, ) (POTENTIAL) 


FT SITE 


244 


246 


CELL ATTACHMENT SITE 


(POTENTIAL) . 


SQ SEQUENCE 


390 AA; 


44185 MW; EB4 78 0E88B7B590E CRC64 ; 


Query Match 




88.6%; 


Score 1895.5; DB 1; 


Length 3 90; 


Best Local Similarity 


89.6%; 


Pred. No. l.le-141; 




Matches 361; 


Conservative 


12; Mismatches 17; 


Indels 13; 



Qy 


1 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


288 


Qy 


361 


Db 


348 



Gaps 1 ; 

MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 MM I II II MM II MINI MM MINI II I III III h 

MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAIRGQILSKLRLS 60 
SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

MIIIIMM MMMMMMMMMIM 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 h hi 

SPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVENTNKI 12 0 

YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I - M I : I I 1 I I I I ; I I 1 I I I I I I I I < I I I 1 M I I I I I 1 i j I I I I I I I I I I ! 
YEKVKKSPHSIYMLFNTSELREAVPEPVLLSRAELRLLRLKLKAEQHVELYQKYSNDSWR 18 0 

YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

I I I I I I M I I Ml M M I I I I I I I I I I: : I I I I I : I I I IIIIMI hlllh 

YLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 

SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

I I M I I I M I I I I I IM I I I I I I I II I II I I I I II Illllllll 
SSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQRR ALDTNYCFS 287 

STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

IIIIIIMIIIIIIMIII IMMMMMM MIMMIMI MM IIIMM" 

STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 
PGASAAPCCVPQALE PLP I VYYVGRKPKVEQLSNMI VRSCKCS 4 03 

I II 1 1 1 1 1 1 M I II M 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 MM 

PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 6 
TGF1_H0RSE 

ID TGFl_HORSE STANDARD; PRT; 3 90 AA. 

AC 019011; 

DT 15-JUL-1998 (Rel . 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Equus cabal lus (Horse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Perissodactyla ; Equidae; Equus. 

OX NCBI_TaxID=9796; 



RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lymph node; 

RX MEDLINE=981855 07; PubMed=9524 8 19 ; 

RA Penha-Goncalves M.N., Onions D.E., Nicolson L. ; 

RT "Cloning and sequencing of equine transforming growth factor-beta 1 

RT (TGF beta-1) cDNA . 11 ; 

RL DNA Seq. 7:375-378(1997). 

CC -!- FUNCTION: TGF -BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF -BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF -BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide- 1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X99438; CAA67801.1; -. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro ; IPR003 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 



DR 


PRINTS; 


PR00438; 


GFCYSKNOT. 




DR 


PRINTS; 


PR01423; 


TGF BETA . 




DR 


ProDom; 


PD000357; 


TGFb; 1. 




DR 


SMART; SM00204; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1. 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


278 


BY SIMILARITY. 


FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N- LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


SQ 


SEQUENCE 


390 AA 


; 43974 MW; 


A86D715F44549691 CRC64 ; 



Query Match 86.1%; Score 1841.5; DB 1; Length 390; 

Best Local Similarity 87.3%; Pred . No. 2e-137; 

Matches 352; Conservative 12; Mismatches 26; Indels 13; Gaps 1; 



Qy 


l 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAIRGQILSKLRLA 

i " ii i! ii i i iii ir 'iii'ii i i mi ii i ' i 

MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAIRGQILSKLRLA 


60 


Db 


1 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

lllllhllllllll INN III HUM 1 llllllllllll IIMIIl hi 

SPPSQGEVPPGPLPEAVLALYNSTRAQVAGESAETEPEPEADYYAKEVTRVLMVEKENEI 


120 


Db 


61 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 

1 = MM IIMIIl Ihh II III II IIMIIl II MUM MM hill 
YKTVETGSHS I YMFFNTS ELRAAVPDPMLLSRAELRLLRLKLS VEQH VELYQKYSNNSWR 


180 


Db 


121 


180 


Qy 


181 


YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 

IIMIIl MIIIIMII IIMIIII.:: hllMIMI 1 IIMIIl 1 Mlh 
YLSNRLLTPSDSPEWLSFDVTGWRQWLSQGGAMEGFRLSAHCSCDSKDNTLRVGINGFS 


240 


Db 


181 


240 


Qy 


241 


SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 

1 IIMIIII Ml Mill IIMIIl IIMIIII IIMI Il 

CCDDfnT AT'TnPMMDDPT T T MRTDT PDSnriT UCCDUDD 7\ T nTMVPDC 


300 


UD 


Z 1 


O O "7 

Zo 1 


Qy 


301 


STEKNCCWQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHN 

IMMIMMMMIMIIII II IIIIIMIMMIIIMIMIMIMM llllll 

STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 


360 


Db 


288 


347 


Qy 


361 


PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 4 03 

1 1 1 II 1 M 1 1 II 1 II II 1 1 1 1 II II M 1 II M M Ml II I ; 

PGASAAPCCVPQVLEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 




Db 


348 





RESULT 7 
TGF1_M0USE 

ID TGF1_M0USE STANDARD; PRT; 3 90 AA. 

AC P04202; 

DT 20-MAR-1987 (Rel . 04, Created) 

DT 20-MAR-1987 (Rel. 04, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=86168129; PubMed=3 0 074 54 ; 

RA Derynck R. , Jarrett J. A. , Chen E.Y., Goeddel D.V. ; 

RT "The murine transforming growth factor-beta precursor."; 

RL J. Biol. Chem. 261:4377-4379(1986). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c ; 

RX MEDLINE=960 9654 5; PubMed=85222 00 ; 

RA Guron C. , Sudarshan C. , Raghow R. ; 

RT "Molecular organization of the gene encoding murine transforming 

RT growth factor beta 1 . " ; 

RL Gene 165:325-326(1995). 

RN [3] 



RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6, and NOD/LT; TISSUE=Spleen; 

RA Poirot L., Benoist C. , Mathis D. ; 

RT "Transforming growth factor-beta 1 sequence and expression: no 

RT difference between NOD/Lt and C57B1/6 mouse strains."; 

RL Submitted (AUG- 1998) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-lmked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; M13177; AAA40423.1; -. 

DR EMBL; L42462; AAB00138.1; -. 

DR EMBL; L42456; AAB00138.1; JOINED. 

DR EMBL; L42457; AAB00138.1; JOINED. 

DR EMBL; L42458; AAB00138.1; JOINED. 

DR EMBL; L42459; AAB00138.1; JOINED. 

DR EMBL; L42460; AAB00138.1; JOINED. 

DR EMBL; L42461; AAB00138.1; JOINED. 

DR EMBL; AJ009862; CAA08900.1; -. 

DR PIR; A01396; WFMS2 . 

DR HSSP; P01137; 1KLA. 

DR MGD; MGI : 98725; Tgf bl . 

DR GO; GO: 0005578; C : extracel lular matrix; IDA. 

DR GO; GO:0006954; P : inflammatory response ; IMP. 

DR GO; GO: 0007515; P: lymph gland development; IMP. 

DR GO; GO: 0008220; P:necrosis; IMP. 

DR GO; GO:0016202; P:regulation of myogenesis; IDA. 

DR InterPro; IPR002400; GF__cysknot . 

DR InterPro; IPR003 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR004 38; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS0 025 0; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 23 POTENTIAL. 

FT PROPEP 24 278 

FT CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 285 294 BY SIMILARITY. 



FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 




Joy 


BY b 1 Ml LiAKl 1 Y . 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED ( GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44310 


MW; 4381A51B711D689E CRC64 ; 


Query Match 




85. 9% 


Score 1837.5; DB 1; Length 390; 



Best Local Similarity 85.9%; Pred. No. 4.1e-137; 

Matches 346; Conservative 15; Mismatches 29; Indels 13; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

i 'illinium iiiiiiiiiiiimiiiimiiiiiiiiiiiimiiiiiiiiii 

Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1 1 1 i ^ 1 1 1 1 1 1 1 1 i 1 1 1 ! 1 1 1 1 1 1 j I •■ II lilliM MMMI 'I I h I I 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I INI MlhMIMIM lllllllll III ' 
Db 121 YEKTKDISHSIYMFFNTSDIREAVPEPPLLSRAELRLQRLKSSVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDS PEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDS KDNTLHVE I NGFN 240 

II MM h h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : : h ! 1 1 Ml I Mill MMMI 

Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDGIQGFRFSAHCSCDSKDNKLHVEINGIS 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 300 

Mill III I I I I I I I I I I I I I I I I I I II I I I I I I lllllllll 
Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFS 287 

Qy 301 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

I I M 1 1 1 'M M M 1 1 1 M M 1 1 1 1 M 1 1 1 M Ml I M I II Ml Ml M 1 1 1 1 1 1 1 1 1 1 

Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 361 PGASAAPCCVPQALE PL P I VY YVGRKPKVEQLSNM I VRSCKCS 4 03 

I I I M 1 1 1 1 1 M II 1 1 1 1 II M I 1 1 1 Ml 1 1 1 1 1 1 1 M III 

Db 348 PGASAS PCCVPQALE PLP I VY YVGRKPKVEQLSNM I VRS CKCS 3 90 



RESULT 8 
TGF1_RAT 

ID TGF1_RAT STANDARD; PRT; 3 90 AA. 

AC P17246; 

DT 01-AUG-1990 (Rel. 15, Created) 

DT 01-AUG-1990 (Rel. 15, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCB I_Tax I D= 1 0 1 1 6 ; 
RN [1] 



RP SEQUENCE FROM N.A. 

RC STRAIN=Sprague-Dawley; TISSUE=Heart ; 

RX MEDLINE=90272425; PubMed=2349108 ; 

RA Qian S.W., Kondaiah P., Roberts A.B., Sporn M.B.; 

RT "cDNA cloning by PCR of rat transforming growth factor beta-1."; 

RL Nucleic Acids Res. 18:3059-3059(1990). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; X52498; CAA36741.1; -. 

DR PIR; S10219; S10219. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 



DR 


PRINTS; 


PR00438; 


GFCYSKNOT . 




DR 


PRINTS; 


PR01423; 


TGFB ETA . 




DR 


ProDom; 


PD000357; 


TGFb; 1. 




DR 


SMART; SM00204; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


278 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED ( GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA 


; 44329 MW; 


5E21108ED50D853C CRC64 ; 



Query Match 85.9%; Score 1837.5; DB 1; Length 390; 

Best Local Similarity 85.9%; Pred. No. 4.1e-137; 

Matches 346; Conservative 14; Mismatches 30; Indels 13; Gaps 



l; 



Qy 


l 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


UD 




Qy 


301 


Db 


288 


Qy 


361 


Db 


348 



MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 

I 1 1 1 1 . 1 1 1 ! II 1 1 I II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II I III ! II I L I !l 1 1 1 1 1 1 1 1 1 

MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 
SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

1 1 II M 1 1 1 1 1 1 M M I II 1 1 Ml 1 1 1 1 1 1 :|||IIIIIIIIIIIMIIIh II 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 120 

YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Ml I Ihll Ml • Mill Ml I II | | I I | I I | I I I I M M I II 
YDKTKDITHSI YMFFNTSDIREAVPEPPLLSRAELRLQRFKSTVEQHVELYQKYSNNSWR 18 0 

YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

II 1 1 II MMMMIIIII I III : : hill III I Mill MM = 

YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDGIQGFRFSAHCSCDSKDNVLHVEINGIS 24 0 
SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

Mill III II 1 1 1 M 1 1 II I II 1 1 M M I M I II MINIMI 

PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRR ALDTNYCFS 2 87 

STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

1 1 1 1 M 1 1 1 i II 1 . 1 1 M M I 1 1 1 1 ; I . M 1 1 1 1 . 1 1 1 1 1 M II : 1 1 1 1 1 i 1 1 1 1 

STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 
PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 03 

Ml M M MM 1 1 1 1 1 1 1 M MMI I II 1 1 1 M 1 1 II 1 1 1 1 III I 

PGASASPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 9 
TGF1_CAVP0 

ID TGF1_CAVP0 STANDARD; PRT; 3 90 AA. 

AC Q9Z1Y6; Q9QZB3 ; Q9R148; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Hystricognathi ; Caviidae; Cavia. 

OX NCBI_TaxID=10141; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Hartley; 

RA Jeevan A., McMurray D.N., Yoshimura T. ; 

RT "Guinea pig transforming growth factor-beta in peritoneal exudates 

RT after BCG vaccination."; 

RL Submitted (OCT-1999) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 265-382 FROM N.A. 

RX MEDLINE=99144670; PubMed=10025978 ; 

RA Scarozza A.M., Ramsingh A.I., Wiener v., Wicher K.; 

RT "Spontaneous cytokine gene expression in normal guinea pig blood and 

RT tissues . " ; 

RL Cytokine 10:851-859(1998). 

RN [3] 



RP SEQUENCE OF 279-371 FROM N.A. 

RC STRAIN=Hartley; TISSUE=Trachea ; 

RA Morishima Y. , Uchida Y. , Nomura A. , Ishii Y. , Sakamoto T . , 

RA Sekizawa K. ; 

RT "Guinea-pig transforming growth factor-beta expression in injured 

RT tracheal epithelium. 11 ■ 

RL Submitted (JUL-1999) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROLS PROLIFERATION, 

CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES . MANY 

CC CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul f ide-1 inked (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF191297; AAF02780.1; -. 

DR EMBL; AF097509; AAC83807.1; 

DR EMBL; AF169347; AAD49347.1; -. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro ; IPR003 911; TGF_TGFb . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS 002 50; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


24 


POTENTIAL. 


FT 


PROPEP 


25 


278 


POTENTIAL. 


FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N- LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


CONFLICT 


279 


279 


G -> P (IN REF. 3) . 


FT 


CONFLICT 


286 


286 


F - > S (IN REF. 2) . 


FT 


CONFLICT 


309 


309 


K - > E (IN REF. 2) . 


FT 


CONFLICT 


322 


322 


C -> R (IN REF. 2) . 



FT CONFLICT 350 350 A -> G (IN REF . 2) . 

SQ SEQUENCE 390 AA; 44328 MW; 1 53 9F84 9BA0C0FF1 CRC64 ; 



Query Match 85.5%; Score 1828.5; DB 1; Length 390; 

Best Local Similarity 85.6%; Pred. No. 2.1e-136; 

Matches 345; Conservative 15; Mismatches 30; Indels 13; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I II llllllllllllllll IIIIIMI IIIIIIIIIIIIIIIMII II MINI 

Db 1 MP PSRLRLLPLLLPLLWLLVLAPGRPASGLSTCKT I DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

in MiiiMi iii niiii iiiiiii ii iiiiiii I I II II I I I I I I I - : I 

Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEPDYYAKEVTRVLMVDNSHNI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

I = Ihll llllllllllhhlllllllh I I I I i I I M I M I I I h III 
Db 121 YKSIETVAHSI YMFFNTSELREAVPDPLLLSRAELRMQRLKLNVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDS PEWLSFDVTGVVRQWLTRREAI EGFRLSAHSSSDSKDNTLH VE I NGFN 240 

Nihil llhlllllllllllllllh: I :|||| Ml | IIIIIII Mill 
Db 181 YLSNQLLTPSDTPEWLSFDVTGWRQWLSQGEELEGFRFSAHCSCDSKDNTLRVEINGIG 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

i r 1 1 1 i n i ii i r 1 1 1 1 1 iii 1 1 1 1 ii n 1 1 1 1 iiiiiiii 

Db 241 PKRRGDLAAIHGMNRPFLLLMATPLERAQHLHSSRHRR GLDTNYCFS 2 87 

Qy 301 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 360 

I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 288 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 347 

Qy 3 61 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 4 03 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 ! 1 1 1 1 1 llllllllllllllll 

Db 348 PGASAAPCCVPQALEPLPI VYYVGRKAKVEQLSNM I VRSCKCS 3 90 



RESULT 10 
TGFl_BOVIN 

ID TGFl_BOVIN STANDARD; PRT; 315 AA. 

AC P18341; 

DT 01-NOV-1990 (Rel . 16, Created) 

DT 01-NOV-1990 (Rel. 16, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (Fragment) . 

GN TGFB1. 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetart iodactyla ; Ruminant ia; Pecora ; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913 ; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=9 1042552; PubMed=3 153459 ; 

RA van Obberghen-Schi 1 1 ing E . , Kondaiah P., Ludwig R.L., Sporn M.B., 

RA Baker C. C. ; 

RT "Complementary deoxyribonucleic acid cloning of bovine transforming 

RT growth factor-beta 1 . " ; 



RL Mol. Endocrinol. 1:693-698(1987). 

RN [2] 

RP SUBUNITS. 

RC TISSUE=Bone; 

RX MEDLINE=92129307; PubMed=1733 936 ; 

RA Ogawa Y., Schmidt D.K., Dasch J.R., Chang R.J., Glaser C.B.; 

RT "Purification and characterization of transforming growth factor-beta 

RT 2.3 and -beta 1.2 heterodimers from bovine bone."; 

RL J. Biol. Chem. 267:2325-2328(1992). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION , AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul f ide- 1 inked . Heterodimers of TGF-beta 1/2 

CC have been found in bone. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 











DR 


EMBL; M36271; AAA30778.1; -. 


DR 


PIR; A40057; A40057. 


DR 


HSSP; P01137; 1KLA . 


DR 


InterPro; 


IPR002400; GF cysknot . 


DR 


InterPro ; 


I PRO 03911; TGF_TGFb. 


DR 


InterPro; 


IPR001839; TGFb . 


DR 


InterPro; 


IPR001111; TGFb N. 


DR 


Pfam; PF00019; TGF-beta; 1. 


DR 


Pfam; PF00688; TGFb_propept ide ; 1. 


DR 


PRINTS; PR00438; 


GFCYSKNOT. 


DR 


PRINTS; PR01423; 


TGFBETA . 


DR 


ProDom; PD000357; 


TGFb; 1. 


DR 


SMART; SM00204; TGFB; 1. 


DR 


PROSITE; 


PS00250; 


TGF BETA 1; 1. 


KW 


Growth factor; Mitogen; Glycoprotein. 


FT 


NON__TER 


1 


1 


FT 


PROPEP 


<1 


203 


FT 


CHAIN 


204 


315 TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


210 


219 BY SIMILARITY. 


FT 


DISULFID 


218 


281 BY SIMILARITY. 


FT 


DISULFID 


247 


312 BY SIMILARITY. 


FT 


DISULFID 


251 


314 BY SIMILARITY. 


FT 


DISULFID 


280 


280 INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


7 


7 N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


61 


61 N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


101 


101 N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


169 


171 CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


315 AA 


; 36269 MW; C2717A23D994E00E CRC64 ; 



Query Match 73.6%; Score 1574.5; DB 1; Length 315; 

Best Local Similarity 89.9%; Pred. No. 1.5e-116; 

Matches 2 95; Conservative 9; Mismatches 11; Indels 13; Gaps 1; 

Qy 76 AVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLF 135 

hlllllllllllllll I llllllllllllllllllll Ihllll I : Ihll I 
Db 1 AI LALYNSTRDRVAGESAETEPEPEADYYAKEVTR VLMVEYGNKI YDKMKSSSHS I YMFF 6 0 

Qy 13 6 NTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEW 195 

II 1 1 1 M II II I II II MM I III II I II II II 1 1 1 II MM MM I I II 1 1 1 M II 

Db 61 NTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEW 12 0 

Qy 196 LSFDVTGVVRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATI HGMNR 255 

IIIIIMIIIIIIIIII lllllllll I lllllll hlllhlllllllllllllll 
Db 121 LSFDVTGVVRQWLTRREEI EGFRLSAHCSCDSKDNTLQVDINGFSSGRRGDLATI HGMNR 18 0 

Qy 256 PFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDF 315 

Illllllllllllllllllllll IMIIMIIIII IIIIIMIIII 

Db 181 PFLLLMATPLERAQHLHSSRHRR ALDTNYCFSSTEKNCCVRQLYI DF 227 

Qy 316 RKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALE 375 

1 1 1 1 1 1 1 1 1 1 1 1 1 ; 1 1 1 1 II I II II 1 1 1 II ! 1 1 1 1 1 1 1 1 Ml 1 1 II 1 1 M 1 1 1 1 ; 1 1 1 1 

Db 228 RKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALE 287 

Qy 376 PLPIVYYVGRKPKVEQLSNMIVRSCKCS 403 

II M II II 1 1 1 1 1 M I II 1 1 II I II 1 1 1 

Db 288 PLPIVYYVGRKPKVEQLSNMIVRSCKCS 315 



RESULT 11 
TGF1_CHICK 

ID TGF1JZHICK STANDARD; PRT; 373 AA. 

AC P09531; 

DT 01-MAR-1989 (Rel . 10, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (TGF-beta 4) 

DE (Fragment) . 

GN TGFB1 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes ; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=White leghorn; 

RX MEDLINE=89112198; PubMed=2464 13 1 ; 

RA Jakowlew S.B., Dillard P.J., Sporn M.B., Roberts A.B.; 

RT "Complementary deoxyribonucleic acid cloning of a messenger 

RT ribonucleic acid encoding transforming growth factor beta 4 from 

RT chicken embryo chondrocytes."; 

RL Mol . Endocrinol. 2:1186-1195(1988). 

RN [2] 

RP REVISIONS. 

RX MEDLINE=92357039; PubMed=1353860 ; 

RA Burt D.W., Jakowlew S.B.; 



RT "Correction: a new interpretation of a chicken transforming growth 

RT factor-beta 4 complementary DNA . " ; 

RL Mol. Endocrinol. 6:989-992(1992). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinformat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; M31160; AAB05637.1; -. 

DR PIR; A41918; A41918. 

DR HSSP; P01137; 1KLA. 

DR InterPro; I PRO 03 9 11; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PRO 142 3; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


NON TER 


1 


1 




FT 


SIGNAL 


<1 


1 


POTENTIAL. 


FT 


PROPEP 


2 


259 


POTENTIAL. 


FT 


CHAIN 


260 


373 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


266 


277 


BY SIMILARITY. 


FT 


DISULFID 


276 


339 


BY SIMILARITY. 


FT 


DISULFID 


305 


370 


BY SIMILARITY. 


FT 


DISULFID 


309 


372 


BY SIMILARITY. 


FT 


DISULFID 


338 


338 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


54 


54 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


109 


109 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


153 


153 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


224 


226 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


373 AA; 


42634 


MW; 9903F3479C8552E5 CRC64 ; 



Query Match 49.0%; Score 1049; DB 1; Length 373; 

Best Local Similarity 53.5%; Pred . No. 3.9e-75; 

Matches 207; Conservative 52; Mismatches 100; Indels 28; Gaps 8; 

Qy 30 LSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVA 89 

1111= =1 = 1 hllllhllllllllll =11 : II llh I MINI:: : 
Db 2 LSTCQRLDLEAAKKKRI EAVRGQILSKLRLTAPPPASETPPRPLPDDVRALYNSTQELLK 61 



Qy 90 GES-VEPEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPV 148 

: : | |: :|:|||: |: | : : : :: ||:: :|| | | | 
Db 62 QRARLRPPPDGPDEYWAKELRRI PMETTWDGAMEHWQPQSHS I FFVFNVSRARRG-GRPT 120 



Qy 14 9 LLSRAELRLLRLKLK VEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGV 2 03 

II MM | || :|||| | | Mill | : : 

Db 121 LLHRAELRMLRQKAAADSAGTEQRLELYQGYGNASWRYLHGRSVRATADDEWLSFDVTDA 18 0 



Qy 204 VRQWLTRREAI EGFRLSAHSSSD SKDNTLHVE I NGFNSGRRGDLATI HGMNR - - PFL 258 

I llh I = 1 = 11 I : = = | II MIM :| =1 | = = 

Db 181 VHQWLSGSELLGVFKLSVHCPCEMGPGHAEEMRISIEGFEQ-QRGDMQSIAKKHRRVPYV 23 9 



Qy 259 LLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCF - - SSTEKNCCVRQLY1 DFR 316 

I II I III MMI II llhlll : MINI I Mill 

Db 24 0 LAMALPAERANELHSARRRR DLDTDYCFG PGTDEKNCCVRPLY I DFR 28 6 

Qy 317 KDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEP 376 

III MIL MINIMUM Ml h 1 1 Ml : l M Ml II Ml Ml Ml 

Db 287 KDLQWKWIHEPKGYMANFCMGPCPYIWSADTQYTKVLALYNQHNPGASAAPCCVPQTLDP 346 



Qy 377 LP I VY Y VGRKPKVEQLSNM I VRS CKCS 403 

IIMIIIII MIMMMIMM 

Db 34 7 LP 1 1 YYVGRNVRVEQLSNMWRACKCS 373 



RESULT 12 
TGF1_XENLA 

ID TGF1_XENLA STANDARD; PRT; 382 AA. 

AC P16176; 

DT 01-APR-1990 (Rel . 14, Created) 

DT 01-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (TGF-beta 5) . 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia ; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI_TaxID=8355; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=901100 90; PubMed=2295 601 ; 

RA Kondaiah P., Sands M.J., Smith J.M., Fields A., Roberts A.B., 

RA Sporn M.B., Melton D.A. ; 

RT "Identification of a novel transforming growth factor-beta (TGF-beta 

RT 5) mRNA in Xenopus laevis."; 

RL J. Biol. Chem. 265:1089-1093(1990). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Vempati U.D., Kondaiah P.; 

RL Submitted (AUG-1997) to the EMBL/GenBank/DDB J databases. 

CC -!- FUNCTION: IMPORTANT ROLE IN CERTAIN ASPECTS OF DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disul f ide-lmked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformat ics and the EMBL outstation - 



CC the European Bioinf ormat ics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is in no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; J05180; AAA4 9968.1; -. 

DR EMBL; AF009335; AAB64441.1; -. 

DR EMBL; AF009331; AAB64441.1; JOINED . 

DR EMBL; AF009332; AAB64441.1; JOINED. 

DR EMBL; AF009333; AAB64441.1; JOINED. 

DR EMBL; AF009334; AAB64441.1; JOINED. 

DR PIR; A34929; B61036 . 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; I PRO 03 911; TGF_TGFb . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PRO 1423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS 0025 0; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

POTENTIAL. 

TRANSFORMING GROWTH FACTOR BETA 1. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
INTERCHAIN (BY SIMILARITY) . 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
CELL ATTACHMENT SITE (POTENTIAL) . 
MW; 1034621C917AAE15 CRC64; 

Score 870.5; DB 1; Length 382; 
Best Local Similarity 46.3%; Pred. No. 4.3e-61; 

Matches 190; Conservative 55; Mismatches 122; Indels 43; Gaps 11; 



FT 


SIGNAL 


1 


21 


FT 


PROPEP 


22 


270 


FT 


CHAIN 


271 


382 


FT 


DISULFID 


277 


286 


FT 


DISULFID 


285 


348 


FT 


DISULFID 


314 


379 


FT 


DISULFID 


318 


381 


FT 


DISULFID 


347 


347 


FT 


CARBOHYD 


73 


73 


FT 


CARBOHYD 


123 


123 


FT 


CARBOHYD 


166 


166 


FT 


SITE 


234 


236 


SQ 


SEQUENCE 


382 AA; 


442 


Query Match 




40 



Qy 
Db 

Qy 

Db 

Qy 
Db 



9 LPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDV 68 

: : I II I M I I Mill :||| |:::|||lllllllll|:| I I 

1 MEVLWMLLvXLVXHLSSLAMSLSTCKAvT)MEEVRKRRIEAIRGQILSKLKLDKTPDV-DS 59 

69 PPGPLPEAVLALYNSTRDRVAGESVEPE PEPEADYYAKEVTRVLMVESGNQI YDK 123 

: I ■ Mill : : - | : I I I I M | || == | 

60 EKMTVPSEAI FLYNSTLEVI REKATREEEHVGHDQNI QDYYAKQVYR FESITELED- 115 

124 FKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKL- - KVEQHVEL YQKYSNDSW - - 179 

I II I Ml I II lllh : : : = | MM II : 

116 HEFKFKFNASHVRENVGMNSLLHHAELRMYKKQTDKNMDQRMELFWKYQENGTTH 170 



Qy 



180 - RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAH SSSDSKDNTLHVE 235 



Ill : I Ihlllll I :|| I I I I I = =11 
Db 171 SRYLESKYITPVTDDEWMSFDVTKTVNEWLKRAEENEQFGLQPACKCPTPQAKD ID 226 

Qy 236 I NGFNSGRRGDLATI HGM - - NRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKAL 293 

I II Mllh: :|:|:: : | || : ||| :| 
Db 227 I EGF- PALRGDLASLSSKENTKPYLMITSMPAERIDTVTSSRKKR GV 272 

Qy 2 94 DTNYCFSSTEKNCCWQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPY I WSLDTQYSKVL 3 53 

III : Illh llhllllllllllllllll Ihlll lllllhllllllll 
Db 273 GQEYCFGNNGPNCCVKPLYINFRKDLGWKWIHEPKGYEANYCLGNCPYIWSMDTQYSKVL 332 

Qy 354 ALYNQHNPGASAAPCCVPQALEPLP I VYY VGRKPKVEQLSNM I VRSCKCS 403 

'MINIMI : I I I I I lllllhlllll lllllllhllll II 
Db 333 SLYNQNNPGASISPCCVPDVLEPLPIIYYVGRTAKVEQLSNMVWSCNCS 382 



RESULT 13 
TGF3_CHICK 

ID TGF3__CHICK STANDARD; PRT; 412 AA. 

AC P16047; 

DT Ol-APR-1990 (Rel . 14, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves ; Neognathae; Gallif ormes ; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=89096966; PubMed=3211158 ; 

RA Jakowlew S.B., Dillard P.J., Kondaiah P., Sporn M.B., Roberts A.B.; 

RT "Complementary deoxyribonucleic acid cloning of a novel transforming 

RT growth factor-beta messenger ribonucleic acid from chick embryo 

RT chondrocytes . " ; 

RL Mol. Endocrinol. 2:747-755(1988). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=White leghorn; 

RX MEDLINE=95169270; PubMed=78 6 512 9 ; 

RA Burt D.W., Dey B . R . , Paton I.R., MorriceD.R., Law A.S.; 

RT "The chicken transforming growth factor-beta 3 gene: genomic 

RT structure, transcriptional analysis, and chromosomal location."; 

RL DNA Cell Biol. 14:111-123(1995). 

RN [3] 

RP SEQUENCE OF 1-117 FROM N.A. 

RC STRAIN=White leghorn; TISSUE=Blood; 

RX MEDLINE=92134496; PubMed=184 0616 ; 

RA Burt D.W., Dey B.R., Paton I.R.; 

RT "Comparative analysis of human and chicken transforming growth 

RT factor-beta 2 and -beta 3 promoters."; 

RL J. Mol. Endocrinol. 7:175-183(1991). 

RN [4] 

RP SEQUENCE OF 1-117 FROM N.A. 

RX MEDLINE=930244 87; PubMed=14 06706 ; 



RA Jakowlew S.B., Lechleider R. , Geiser A.G., Kim S.J., 

RA Santa-Coloma T.A. , Cubert J., Sporn M.B., Roberts A.B.; 

RT "Identification and characterization of the chicken transforming 

RT growth factor-beta 3 promoter."; 

RL Mol. Endocrinol. 6:1285-1298(1992). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENESIS AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to licenseSisb-sib . ch) . 

CC 

DR EMBL; M31154; AAA49089.1; -. 

DR EMBL; X58127; CAA41128.2; -. 

DR EMBL; X60055; CAA41128.2; JOINED . 

DR EMBL; X60091; CAA41128.2; JOINED. 

DR EMBL; X60090; CAA41128.2; JOINED. 

DR EMBL; S46000; AAB23575.1; 

DR PIR; A34939; A34939. 

DR HSSP; P10600; 1TGJ. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR0043 8; GFCYSKNOT. 

DR PRINTS; PRO 1423; TGFBETA. 

DR ProDom; PD00 03 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


300 




FT 


CHAIN 


301 


412 


TRANSFORMING GROWTH FACTOR BETA 3 . 


FT 


DISULFID 


307 


316 


BY SIMILARITY. 


FT 


DISULFID 


315 


378 


BY SIMILARITY. 


FT 


DISULFID 


344 


409 


BY SIMILARITY. 


FT 


DISULFID 


348 


411 


BY SIMILARITY. 


FT 


DISULFID 


377 


377 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


74 


74 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


135 


135 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


142 


142 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


261 


263 


CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


CONFLICT 


323 


326 


ELPT -> DFRQ (IN REF . 1) . 


SQ 


SEQUENCE 


412 AA; 


47077 


MW; 1CAB883170069D55 CRC64 ; 



Query Match 3 9.9%; Score 854.5; DB 1; Length 412; 

Best Local Similarity 45.9%; Pred. No. 8.7e-60; 

Matches 195; Conservative 56; Mismatches 117; Indels 57; Gaps 14; 



Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAI RGQILSKLRLASPPSQGDVPPGPLP 74 

h I =h = Ihl hhl MMhMMMMMMI Ml I I : I 

Db 9 LVLLSLLSFATVSLALSSCTTLDLEHI KKKRVEAI RGQILSKLRLTSPPE- -SVGPAHVP 66 

Qy 75 EAVLALYNSTRDRVAGESVEPEPE PEADYYAKEVTRVLMVE SGNQIYDKF 124 

hlhll h : I =| | | I : I ! I I I : : | = : I- 

Db 67 YQILALYNSTRELL- -EEMEEEKEESCSQENTESEYYAKEIHKFDMIQGLPEHNELGI CP 124 

Qy 125 KGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND 177 

II = = : II I I I III hi h : II :|| = l I 

Db 125 KGVTSNVFR - FNVS SAEKNSTNLFRAEFRVLRVPNPSSKRSEQRI ELFQI LRPDEHI 180 

Qy 178 - SWRYLSNRLLAPSDS PEWLSFDVTGWRQWLTRREAI EGFRLSAH SSSDSKD 229 

I I I I I : MINIUM I h I I I M I Ml 

Db 181 AKQR YLSGRNVQTRGS PEWLSFDVTDTVREWLLHRESNLGLE I S I HCPCHTFQPNGD I LE 240 

Qy 23 0 N TLHVE I NGFNSG RRGDLATI HGMNRPFLLLMATPLERAQH- -LHSSRHRR 278 

I I - I M MM : =: I IMI | h I I • I 

Db 241 NLHEVLEIKFKGIDSEDDYGRGDLGRLKKQKDLHNPHLILMMLPPHRLESPTLGGQRKKR 300 

Qy 279 ALDTNDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGP 338 

l.ll : hi M Mh I lllhll . I MM II 

Db 301 ALDTNYCFRNLEENCCVRPLYI ELPTDLGWKWVHEPKGYFANFCSGP 34 7 

Qy 33 9 CPYI WSLDTQYSKVLALYNQHN PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VR 3 98 

Mh I II M M III II -III MM hlllll llllllllhh 

Db 34 8 CPYLRSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWK 4 07 

Qy 3 99 SCKCS 4 03 

Mill 

Db 4 08 SCKCS 412 



RESULT 14 
TGF3_PIG 

ID TGF3_PIG STANDARD; PRT; 4 09 AA. 

AC P15203; 

DT 01-APR-1990 (Rel . 14, Created) 

DT 01-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 

OX NCBI__TaxID=9823 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RX MEDLINE=8 909112 0; PubMed=32 08 74 6 ; 

RA Derynck R., Lindquist B., Lee A., Wen D. , Tamm J., Graycar J.L., 

RA Rhee L. , Mason A.J., Miller D.A., Coffey R.J., Moses H.L., 

RA Chen E . Y . ; 

RT "A new type of transforming growth factor-beta, TGF-beta 3."; 

RL EMBO J. 7:3737-3743(1988). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENESIS AND CELL DIFFERENTIATION . 

CC -!- SUBUNIT: Homodimer; disul f ide- 1 inked . 



CC SUBCELLULAR LOCATION : Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license©isb-sib . ch) . 

CC 

DR EMBL; X14150; CAA32363.1; -. 

DR PIR; S01825; S01825 . 

DR HSSP; P10600; 1TGJ. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; I PRO 03 911; TGF_TGFb . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD00 03 57; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 18 POTENTIAL. 

FT PROPEP 19 2 97 

FT CHAIN 298 409 TRANSFORMING GROWTH FACTOR BETA 3. 

FT DISULFID 304 313 BY SIMILARITY. 

FT DISULFID 312 375 BY SIMILARITY. 

FT DISULFID 341 406 BY SIMILARITY. 

FT DISULFID 345 408 BY SIMILARITY. 

FT DISULFID 374 374 INTERCHAIN (BY SIMILARITY) . 

FT CARBOHYD 72 72 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 133 133 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 140 140 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT SITE 259 261 CELL ATTACHMENT SITE (POTENTIAL) . 

SQ SEQUENCE 409 AA; 46814 MW; B4 90 023 5B5CC955E CRC64 ; 

Query Match 39.1%; Score 836; DB 1; Length 409; 

Best Local Similarity 45.0%; Pred . No. 2.5e-58; 

Matches 190; Conservative 58; Mismatches 122; Indels 52; Gaps 14; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLASPPSQGDVPPGPLP 74 

hi' : :||| hi : : h I h I I I h I I I I I I I I ||| : I 

Db 7 LVVLALLNFATVSLSMSTCTTLDFDHI KRKRVEAI RGQI LSKLRLTSPPDPSML - - ANI P 64 

Qy 75 EAVLALYNSTR DRVAGES VE - - PEPEPEADYYAKEVTRVLMV ESGNQ I YDKFKG 126 

II MINI * h ' : hh III : |: | | : || 
Db 65 TQVLDLYNSTRELLEE VHGERGDDCTQENTESEYYAKE I YKFDM I QGLEEHNDLAVCPKG 124 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

|| I : = I Ml hlh : || :||:| | 

Db 125 ITSKIFR-FNVSSVEK NETNLFRAEFRVLRMPNPSSKRSEQRI ELFQI LQPDEHIAK 180 



Qy 



179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAH SSSDSKDN- 230 



Db 



181 QRYI DGKNLPTRGAAEWLSFDVTDTVREWLLRRESNLGLEI S I HCPCHTFQPNGDI LEN1 24 0 



Qy 231 - -TLHVEINGFNS GRRGDLATIHGM- -NRPFLLLMATPLERAQH- -LHSSRHRRALD 281 

: :: I I I I II = : I h I I I : I : I : I = I 

Db 241 QE VME I KFKG VDS EDD PGRGDLGRLKKKKEHS PHL I LMM I P PDRLDN PGLGAQRKKR 297 

Qy 2 82 TNDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPY 341 

iiiiiiii : hlllll I ; II I hi I III hi I I I I h ,1 M Mill 

Db 2 98 ALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPY 34 7 

Qy 342 IWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCK 4 01 

: I II :| II III II lihilllll I I I I hlllll lllllllhhill 

Db 348 LRSADTTHSSVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTAKVEQLSNMWKSCK 4 07 

Qy 4 02 CS 4 03 
II 

Db 408 CS 409 

RESULT 15 
TGF3_MOUSE 

ID TGF3_MOUSE STANDARD; PRT; 410 AA. 

AC P17125; 

DT 01-AUG-1990 (Rel . 15, Created) 

DT 01-AUG-1990 (Rel. 15, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=90190650; PubMed=262 8 73 0 ; 

RA Miller D.A., Lee A., Matsui Y . , Chen E.Y., Moses H.L., Derynck R . 

RT "Complementary DNA cloning of the murine transforming growth 

RT factor-beta 3 (TGF beta 3) precursor and the comparative expression 

RT of TGF beta 3 and TGF beta 1 messenger RNA in murine embryos and 

RT adult tissues . " ; 

RL Mol . Endocrinol. 3:1926-1934(1989). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=91000714 ; PubMed=22 06556 ; 

RA Denhez F. , Lafyatis R., Kondaiah P., Roberts A.B., Sporn M.B.; 

RT "Cloning by polymerase chain reaction of a new mouse TGF-beta, 

RT mTGF-beta 3 . " ; 

RL Growth Factors 3:139-146(1990). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENESIS AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 



CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; M32745; AAA40422.1; -. 

DR PIR; A41397; A41397. 

DR HSSP; P10600; 1TGJ. 

DR MGD; MG I : 9872 7; Tgf b3 . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 



DR 


PRINTS; 


PR00438; 


GFCYSKNOT. 




DR 


PRINTS; 


PR01423; 


TGFB ETA . 




DR 


ProDom; 


PD000357; 


TGFb; 1. 




DR 


SMART; SM002 04; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1. 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


298 


POTENTIAL. 


FT 


CHAIN 


299 


410 


TRANSFORMING GROWTH FACTOR BETA 3. 


FT 


DISULFID 


305 


314 


BY SIMILARITY. 


FT 


DISULFID 


313 


376 


BY SIMILARITY. 


FT 


DISULFID 


342 


407 


BY SIMILARITY. 


FT 


DISULFID 


346 


409 


BY SIMILARITY. 


FT 


DISULFID 


375 


375 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


72 


72 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


133 


133 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


140 


140 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


259 


261 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


410 AA; 46884 MW; 


250F7048CA432BD6 CRC64 ; 



Query Match 38.9%; Score 832.5; DB 1; Length 410; 

Best Local Similarity 44.7%; Pred. No. 4.7e-58; 

Matches 188; Conservative 58; Mismatches 126; Indels 49; Gaps 13; 
Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 





Db 



7 LWLALLNLATI SLSLSTCTTLDFGH I KKKRVEAI RGQI LSKLRLTS PPEPSVMT- - HVP 



Qy 



75 EAVLALYNSTR DRVAGESVE- - PEPEPEADYYAKEVTRVLMVE SGNQI YDKFKG 126 



Db 




Qy 



127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 



Db 



III'- I III I'll' II • I I • I I 

125 ITSKVFR- FNVSSVEK- - -NGTNLFRAEFRVLRVPNPSSKRTEQRI ELFQI LRPDEH I AK 18 0 




Qy 



179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDN- 230 



Db 



181 QRY I GGKNLPTRGTAEWLS FDVTDTVR EWLLRRESNLGLE I S I HC PCHTFQPNGD I LENV 24 0 




Qy 



231 



- - TLHVE I NGFNS GRRGDLATIHGM- - -NRPFLLLMATPLERAQHLHSSRHRRALDT 282 



Db 



241 HEVME I KFKGVDNEDDHGRGDLGRLKKQKDHHN PHLI LMM I PPHRLDS PGQGSQRK 296 



Qy 283 NDYKDDDDKALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI 342 

HIIIIIII : hlllll . I I I I hll I I lh I I I I I h i I I I Mllh 
Db 2 97 KRALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYL 34 9 

Qy 343 WSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKC 4 02 

I II :| II III II llhllllll I I I I hlllll llllllllhhllll 
Db 350 RSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKC 409 

Qy 403 S 403 

I 

Db 410 S 410 



Search completed: October 28, 2003, 09:08:41 
Job time : 9.07747 sees 

GenCore version 5.1.6 
Copyright (c) 19 93 - 2 003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



October 28, 2003, 07:50:55 ; Search time 32.3099 Seconds 

(without alignments) 
3218.683 Million cell updates/sec 

US-10-017-372E-33 
2139 

1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNMI VRSCKCS 4 03 

BLOSUM62 

Gapop 10.0 , Gapext 0 . 5 

830525 seqs, 258052604 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



830525 



Database 



SPTREMBL 23:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 



sp archea:* 
sp_bacteria : * 
sp_f ungi : * 
sp_human : * 
sp_invertebrate : * 
sp_mammal : * 
sp_mhc : * 
sp_organelle : * 
sp_phage : * 
sp_plant : * 



11 


sp_ 


rodent : * 


12 


sp_ 


virus : * 


13 


sp_ 


vertebrate : * 


14 


sp_ 


unclassified: * 


15 


sp_ 


_rvirus : * 


16 


sp_ 


_bacteriap : * 


17 


sp_ 


_archeap: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 
No . 


Score 


Query 

Match Length 


DB 


i D 


Descr ipt ion 


i 
i 


1 o 1 / . b 


85 


0 


390 


b 


\l y i uno 


yyuurno equus caoai 


2 


1738 . 5 


81 


3 


368 


1 1 


Ann /i t\ o 
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3 
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39 


9 


379 


13 


Q8 JHF5 
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4 


o b U . b 


39 


8 


379 
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^OdXAO bpdlUo dUId 
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D 
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38 


9 
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1 1 
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r 

6 


O "5 O 
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38 


9 
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1 J 


^ Q OTaT a q 

y y rWAy 
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7 


826 


38 


6 


414 


1 1 


yy l vPb 


Q91vp5 mus musculu 


8 
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37 


9 


382 


13 


uy .544 y 


09344 9 oncorhynchu 


9 


78 0 


36 


5 


376 


13 


yypiyz 


Q9ptC[2 cyprinus ca 


10 


/6b 


35 


8 


399 


1 1 


yy bKb / 


yyerD/ uiesocr iceLU 


1 1 


724 . 5 


33 


9 


362 


1 1 


yyyKi / 


Q99kl7 mus musculu 


12 


693.5 


32 


4 


130 


11 


Q08 7 14 


Q08714 mesocricetu 


13 


678.5 


31 


7 


361 


13 


Q98 8 54 


Q98854 cyprinus ca 


14 


676.5 


31 


6 


124 


r 
D 


yy bNo u 


yybnou cams ramii 


1 5 


638 


29 


8 


112 


6 


Uuz /J U 
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16 


c r\ c 
bub 


28 


3 


255 


1 1 


yy z ill 


(jyziti mus muscuxu 


17 
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27 


4 


127 


6 


Q9TV08 


Q9tv08 canis famil 


T O 

1 8 


575 


26 


9 


224 


1 1 


a o r*r\ r 7 d 

y8Luzy 


Q8 cdz 9 mus musculu 


19 


563.5 


26 


3 


200 


13 


Q90YF1 


Q90yf 1 pleuronecte 


20 


554 


25 


9 


101 


11 


Q9R184 


Q9rl84 meriones un 


21 


461 


21 


6 


179 


13 


Q90YF2 


Q90yf2 pleuronecte 


22 


413 


19 


3 


88 


13 


Q90YF5 


Q90yf5 pleuronecte 


23 


402 


18 


8 


88 


13 


Q90YF7 


Q90yf7 oncorhynchu 


24 


397 


18 


6 


88 


13 


Q90ZE7 


Q90ze7 acipenser b 


25 


393 


18 


4 


87 


13 


042306 


042306 carassius a 


26 


383 


17 


9 


91 


6 


Q9MYZ1 


Q9myzl capra hircu 


27 


376.5 


17 


6 


309 


4 


Q8WV88 


Q8wv88 homo sapien 


28 


373 


17 


4 


86 


6 


Q28241 


Q28241 cervus elap 


29 


358 


16 


7 


81 


6 


Q9N1S3 


Q9nls3 capreolus c 


30 


340.5 


15 


9 


375 


13 


Q8UWD8 


Q8uwd8 columba liv 


31 


327.5 


15 


3 


375 


13 


Q8AVB2 


Q8avb2 coturnix co 


32 


325.5 


15 


2 


375 


13 


Q8UWD7 


Q8uwd7 coturnix ch 


33 


325 . 5 


15 


2 


389 


13 


Q90YY0 


Q90yy0 ictalurus p 


34 


320.5 


15 


0 


375 


13 


Q98SP0 


Q98sp0 gallus gall 


35 


320 


15 


0 


87 


13 


Q8JHB6 


Q8jhb6 scophthalmu 


36 


317.5 


14 


8 


375 


6 


Q9GM97 


Q9gm97 equus cabal 


37 


317.5 


14 


8 


375 


13 


Q8UWE0 


Q8uwe0 anas platyr 


38 


317 


14 


8 


77 


13 


Q90YF8 


Q90yf8 oncorhynchu 


39 


315.5 


14 


7 


375 


13 


Q8UWD9 


Q8uwd9 anser anser 


40 


313.5 


14 


7 


375 


6 


Q8HY52 


Q8hy52 lepus capen 



41 


310.5 


14 . 


. 5 


375 


6 


Q95J86 


Q95j86 macaca fasc 


42 


302 


14 . 


, 1 


50 


6 


Q28240 


Q28240 cervus elap 


43 


301 


14 . 


. 1 


62 


13 


Q90ZJ7 


Q90zj7 anguilla an 


44 


301 


14 . 


. 1 


376 


13 


Q98TB4 


Q98tb4 oreochromis 


45 


300.5 


14 . 


. 0 


375 


6 


Q8WNS6 


Q8wns6 bos taurus 



ALIGNMENTS 



RESULT 1 
Q9TUM8 

ID Q9TUM8 PRELIMINARY ; PRT; 390 AA. 

AC Q9TUM8 ; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13 , Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

GN TGFB1. 

OS Equus caballus (Horse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Perissodactyla ; Equidae; Equus. 

OX NCBI_TaxID=9796; 

RN [ 1 ] 

RP SEQUENCE FROM N . A . 

RA Nixon A.J., Brower-Toland B.T., Sandell L.J.; 

RT "Molecular cloning of equine transforming growth factor beta 1 reveals 

RT a divergent nucleotide structure that encodes a novel bioactive 

RT peptide among mammalian species."; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF175709; AAD49431.1; -. 

DR HSSP; P01137; lKLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb__N . 

DR InterPro; IPR003911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423 ; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM0 02 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 390 AA; 43860 MW; 22 0FE4 0DFCCA6 01 6 CRC64 ; 

Query Match 85.0%; Score 1817.5; DB 6; Length 390; 

Best Local Similarity 86.4%; Pred. No. le-152; 

Matches 348; Conservative 12; Mismatches 30; Indels 13; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAI RGQI LSKLRLA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I 
Db 1 M P PSGLRLLPLLLPLLWLLVLT PGR PAAGLSTCKT I DMELVKRKR I EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

I I I I I h I I I I I I I I I I I I I I I I I I -Mill I I I I I 1 1 1 II I I I I I I I I I I I hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRAQVAGESAETEPEPEADYYAKEVTRVLMVEKENEI 120 



Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

I : Ihll II MM llh hi I Ml Ml Mill I I II Mill III hi II 
Db 121 YKTVETGSHSIYMFFNASELRAAVPDPMLLSRAELRLLRLKLSVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDS PEWLSFDVTG WRQWLTRREAI EGFRLSAHSSSDSKDNTLHVE I NGFN 24 0 

MM II M I II II MM !M II I!- IMI MIM lllllll I lllh 

Db 181 YLSNRLLTPSDS PEWLSFDVTG VVRQWLSQGGAMEGLRLSAHCPCDSKDNTLRVGINGFS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFS 3 00 

I III MM I MM Ml III II II I III I II I Ml I 1 1 III II I 

Db 241 SSRRGDLATIDGMNRPFLLLMATPLERAQQLHSSRHRR ALDTNYCSS 287 

Qy 3 01 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 3 60 

1 1 1 M II 1 1 1 II II 1 1 1 1 M M M I II 1 1 1 II 1 1 II I M I M M I II II M I II I II 1 1 1 

Db 2 88 STEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVIALYNQHN 34 7 

Qy 361 PGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 4 03 

1 1 1 1 1 1 1 !; 1 1 - lllllllllll IMI MIIIMIIIIII 

Db 348 PGASAAPCCVPQVLEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 2 
Q8R4D9 

ID Q8R4D9 PRELIMINARY; PRT; 368 AA. 

AC Q8R4D9; 

DT 01-JUN-2 002 (TrEMBLrel . 21, Created) 

DT 01-JUN-2002 (TrEMBLrel . 21, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta-1 protein (Fragment) . 

GN TGFB1 . 

OS Sigmodon hispidus (Hispid cotton rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Sigmodontinae; 

OC Sigmodon. 

OX NCB I_TaxI D=4 2 415; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Blanco J.C., Pletneva L.M., Prince G.A.; 

RT "Cotton rat cytokines, chemokines, and interferons."; 

RL Submitted (FEB-2002) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF480858; AAL87199.1; -. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PRO 043 8; GFCYSKNOT. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS002 50; TGF_BETA_1 ; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 368 AA; 41905 MW; A5C9 12 07B04 68B4A CRC64 ; 



Query Match 81.3%; Score 1738.5; DB 11; Length 368; 

Best Local Similarity 85.3%; Pred. No. 9.8e-146; 



Matches 325; Conservative 16; Mismatches 27; Indels 13; Gaps 1; 



Qy 23 PGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVTALYN 82 

II M 1 1 ! 1 1 1 1 1 I MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! Ml I h I II 1 1 i 1 1 :l 1 : 1 1 

Db 1 PGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYN 60 

Qy 83 STRDRVAGESVEPEPEPEADYYAKEVTRVTjMVESGNQI YDKFKGTPHSLYMLFNTSELRE 142 

Illlllllll : I I 1 I I I I I I I I I I I I I I I I I : I MM I MMII I M I • - 1 I 
Db 61 STRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI YDKTKDI PHSVYMFFNTSDI RE 120 

Qy 14 3 AVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTG 2 02 

Mill lllllllll I I III II I I M M I M I I I II I i I : I : I ! I I I I I I I I 
Db 121 AVPEPPLLSRAELRLQRFKSNVEQHVELYEKYSNNSWRYLGNRLLSPTDSPEWLSFDVTS 18 0 

Qy 203 WRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMA 262 

llhll • • hill III I I I I I I lllllil : II Ml III Ml II II I II 
Db 181 WRKWLNQGDGI QGFRFSAHCSCDSKDN I LHVE I NG I S PKRRGDLGTI HDMNRPFLLLMA 240 

Qy 263 TPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFSSTEKNCCVRQLY I DFRKDLGWK 322 

1 1 1 1 M 1 1 M I II I II MIIIIIIIIMMI II MMMIIMM 

Db 241 TPLERAQHLHSSRHRR ALDTNYCFSSTEKNCCVRQLY I DFRKDLGWK 28 7 

Qy 323 WIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYY 3 82 

M 1 1 1 1 1 1 1 1 II I II 1 1 1 1 M 1 1 1 1 M M 1 1 1 II 1 1 M I M Ml 1 1 II 1 1 1 1 1 1 1 1 1 1 

Db 288 WIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASASPCCVPQALEPLPIVYY 347 

Qy 383 VGRKPKVEQLSNM I VRS CKCS 403 

I I I I I I I II I I I I I I I II I I I 
Db 34 8 VGRKPKVEQLSNM I VRS CKCS 3 68 



RESULT 3 
Q8JHF5 

ID Q8JHF5 PRELIMINARY; PRT; 379 AA. 

AC Q8JHF5; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

OS Sparus aurata (Gilthead sea bream) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha; Perciformes; Percoidei; 

OC Sparidae; Sparus. 

OX NCB I _Tax I D= 8 1 7 5 ; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Tafalla C, Aranguren R . , Secombes C.J., Castrillo J.L., Novoa B., 

RA Figueras A. ; 

RT "Molecular characterization of sea bream (Sparus aurata) transforming 

RT growth factor betal." ; 

RL Fish and Shellfish Immunol. 0:0-0(2002). 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF510084; AAN03842.1; -. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 



DR InterPrO; IPR003911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFbjpropeptide; 2. 

DR PRINTS; PRO 04 38; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART ; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 379 AA; 43506 MW ; C0C9D3D2FCA2 9C0E CRC64 ; 

Query Match 39.9%; Score 854.5; DB 13; Length 379; 

Best Local Similarity 46.2%; Pred. No. 2.7e-67; 

Matches 187; Conservative 69; Mismatches 108; Indels 41; Gaps 



15; 



Qy 

Db 



12 LLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRL-ASPPSQGDVPP 70 

h | = = := I h :hllllhhhlhlllllll llllilll I II 

3 LVFLMFMVAYTVGK- VSGMSTCKTLDLEMVKKKR I EAI RSQI LSKLRLPTES PQAGD - - E 59 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 



71 GPLPEAVLALYNSTRDRVAGESVEPE PEPEADYYAKE VTRVLMVESGNQI YDKFK 12 5 

= | =:|=||||| = = = : I : = I :| = | III : I = 
60 EEI PSSLLSLYNSTKEMLKEQQTEVQTDIFTEQEEEEYFA TRVHKFNTTNPV 111 



126 



112 



GTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKV- - -EQHVELYQKYSNDSWRYL 

II := I II Ihl == : Ih lllhl :| II Mill I III 

RTPQNMSMSFN I SE I RRS I GDYRLLTTAELRML - 1 KAPT I LDEQRVELYQGLGT - S PR YL 



182 



169 



183 SNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGF--RLSAHSSSDSKDNTLHVEINGFN 24 0 

-I : MIIIMI == II : == I I! I =1 hi 

170 ASRFITNELRDKWLSFDVTETLQNWLKGNDDVQVFQLRLYCDCGRSSDVSTFSFGISGMT 22 9 

241 SGRRGDLATIHGMNR- -PFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYC 298 

= I III I I = h:| hi : II • I I =h • I I I 

230 AG - RGDKAVLDDMTKQPP YI LTMS I PKNVSSHL - TSRKKRSTETKD TC 275 

299 FSSTEKNCCWQLYIDFRKDLGWPCWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQ 358 

: II MM I I I I I I I I I I M II h I lllhhl I III: : MhMIII 

276 TAQTE -TCCVRSLYI DFRKDLGWKWI HKPTRYHANYCMGSCTYI WNAENKYSQI LALYKH 334 



359 



403 



HN PGASAAPCCVPQALE PLP I VYYVGRKPKVEQLSNM I VRS CKCS 

MM M I ! 1 1 lllhl 1 1 1 1 : lllllllllhlllll 

335 HNPGASAQPCCVPQALEPLPILYYVGRQHKVEQLSNMIVKSCKCS 379 



RESULT 4 
Q8AXK8 
ID 



Q8AXK8 PRELIMINARY; PRT; 379 AA . 

AC Q8AXK8 ; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

OS Sparus aurata (Gilthead sea bream) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha ; Acanthopterygii ; Percomorpha ; Perciformes; Percoidei; 

OC Sparidae; Sparus. 

OX NCBI_TaxID=8175; 



RN [1] 

RP SEQUENCE FROM N.A. 

RA Tafalla C. , Novoa B., Aranguren R . , Figueras A. ; 

RT "Molecular cloning and characterization of sea bream (Sparus aurata) 

RT TGF beta 1 . " ; 

RL Submitted (SEP-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF424703; AAN76665.1; -. 

SQ SEQUENCE 379 AA; 43485 MW; A3 06EC387F6DBA7C CRC64 ; 



Query Match 39.8%; Score 850.5; DB 13; Length 379; 

Best Local Similarity 46.2%; Pred. No. 6e-67; 

Matches 187; Conservative 68; Mismatches 109; Indels 41; Gaps 15; 

Qy 12 LLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRL-ASPPSQGDVPP 70 

h h = = = I h : I = I I I I I = I : h I h I I I I I I I I I I I I I I I I II 

Db 3 LVFLMFMVAYTVGK-VSGMSTCKTLDLEMVKKKRIEAIRSQILSKLRLPTESPQAGD- -E 59 

Qy 71 GPLPEAVLALYNSTRDRVAGESVEPEPE PEADYYAKEVTRVLMVESGNQI YDKFK 125 

= 1 = = hllllh = : : I : : I -hi Ml = I • 
Db 60 EEI PSSLLSLYNSTKEMLKEQQTEVQTDI FTEXEEEEYFA TRVHKFNTTNPV 111 

Qy 126 GTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKV EQHVELYQKYSNDSWRYL 182 

II = = I II I hi == : 11 = III hi = 1 Illllll I III 

Db 112 RTPQNMSMS FN I SE I RRS I GDYRLLTTAELRML - I KAPT I LDEQRVELYQGLGT - S PRYL 169 

Qy 183 SNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGF- -RLSAHSSSDSKDNTLHVEINGFN 240 

-I : HI II II I - II = == I II I * I hi 

Db 17 0 ASRF I TN ELRDKWLS FDVTETLQNWLKGNDD VQVFQLRLYCDCGRS SDVSTFS FG I SGMT 22 9 

Qy 241 SGRRGDLATIHGMNR- - PFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYC 298 

:| III I : I : 1-1 h I = I I = I I = 1= = I I I 
Db 230 AG-RGDKAVLDDMTKQPPYILTMSI PKNVSSHL-TSRKKRSTETKD TC 275 

Qy 2 99 FSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQ 358 

: II I I I I llllllllllllllhl lllhhl I llh : =11= I I I I 
Db 276 TAQTE-TCCVRSLYIDFRKDLGWKWIHKPTRYHANYCMGSCTYIWNAENKYSQXLALYKH 334 

Qy 3 59 HNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 03 

Illllll llllllllllllhllllh IIMIIIIIhlllll 

Db 335 HNPGASAQPCCVPQALEPLPILYYVGRQHKVEQLSNMIVKSCKCS 379 



RESULT 5 
Q91YU7 

ID Q91YU7 PRELIMINARY; PRT; 412 AA . 

AC Q91YU7; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor, beta 3. 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 



RA Strausberg R.; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; BC014690; AAH14690.1; -. 

DR MGD; MGI: 98727; Tgf b3 . 

DR InterPro; IPR002400; GF__cysknot . 

DR InterPro; IPRO01839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; IPR003911; TGFJTGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA. 

DR ProDom; PD0 003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1; 1. 

SQ SEQUENCE 412 AA; 47144 MW; F3 EB65D04 6DF32AD CRC64 ; 



Query Match 38.9%; Score 832.5; DB 11; Length 412; 

Best Local Similarity 44.7%; Pred. No. 2.7e-65; 

Matches 188; Conservative 58; Mismatches 126; Indels 49; Gaps 13; 



Qy 15 LLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLASPPSQGDVPPGPLP 74 

■ : Mil hi MM Ml MMIMIMM III : M 

Db 9 LWLALLNLATI SLSLSTCTTLDFGH I KKKRVEAI RGQI LSKLRLTSPPEPSVMT - - HVP 66 

Qy 75 EAVLALYNSTR DRVAGESVE- - PEPEPEADYYAKEVTRVLMVE SGNQI YDKFKG 126 

I I 1,1,1 I I : : II I : MMMIM = h= | = : II 

Db 67 YQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEIHKFDMIQGLAEHNELAVCPKG 126 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

- II I : : I III hlh : II MIM I 

Db 127 ITSKVFR-FNVSSVEK. NGTNLFRAEFR VXRVPNPSSKRTEQRI ELFQILRPDEHIAK 182 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDN- 23 0 

lh : I : MINIM Ihll llh I Ml • I M 

Db 183 QRYIGGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEISIHCPCHTFQPNGDILENV 242 

Qy 231 - -TLHVEINGFNS GRRGDLATIHGM NRPFLLLMATPLERAQHLHSSRHRRALDT 282 

: :: I :: MM = ' M M I I I h 
Db 243 HEVMEI KFKGVDNEDDHGRGDLGRLKKQKDHHNPHLI LMMI PPHRLDSPGQGSQRK 298 

Qy 283 NDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYI 342 

MM : I M 1 1 1 1 1 1 M I h 1 1 1 1 1 1 M 1 1 1 1 h II 1 1 1 1 II M 

Db 299 KRALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYL 351 

Qy 343 WSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKC 402 

I II M II III II .MM MM hlllll MIMMMMMMI 

Db 3 52 RSADTTHST VLGLYNTLN P EASAS PCCVPQDLE PLT I LYY VGRTP KVEQLSNM WKS CKC 411 

Qy 403 S 403 

I 

Db 412 S 412 



RESULT 6 
Q9PWA9 



ID Q9PWA9 PRELIMINARY ; PRT; 382 AA . 

AC Q9PWA9; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 {TrEMBLrel. 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

GN TGF-BETA. 

OS Morone chrysops x Morone saxatilis (white bass x striped bass) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

OC Act inopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha; Percif ormes ; Percoidei; 

OC Moronidae; Morone. 

OX NCBI JTaxID = 45352 ; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TI SSUE=KI DNEY ; 

RX MEDLINE=20394636; PubMed=10938723 ; 

RA Harms C.A. , Kennedy-Stoskopf S., Horne W.A., Fuller F.J., 

RA Tompkins W.A.F.; 

RT "Cloning and sequencing hybrid striped bass (Morone saxatilis x M . 

RT chrysops) transforming growth factor-beta (TGF-beta) , and development 

RT of a reverse transcription quantitative competitive polymerase chain 

RT reaction (RT-qcPCR) assay to measure TGF-beta mRNA of teleost fish."; 

RL Fish Shellfish Immunol. 10:61-85(2000). 

CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 

CC RESPONSE. MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS. 

CC -!- SUBUNIT: HOMODIMER, DI SULF I DE - L I NKED (BY SIMILARITY). 

CC -!- TISSUE SPECIFICITY: HIGHER LEVELS FOUND IN MONONUCLEAR CELLS FROM 

CC PERIPHERAL BLOOD THAN IN SPLEEN OR ANTERIOR KIDNEY. 

CC SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF140363; AAD46997.1; -. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR InterPro; IPR003911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS 002 50; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


7 


POTENTIAL. 


FT 


PROPEP 




270 




FT 


CHAIN 


271 


382 


TRANSFORMING GROWTH FACTOR BETA. 


FT 


DISULFID 


278 


286 


BY SIMILARITY. 


FT 


DISULFID 


285 


348 


BY SIMILARITY. 


FT 


DISULFID 


314 


379 


BY SIMILARITY. 


FT 


DISULFID 


318 


381 


BY SIMILARITY. 


FT 


DISULFID 


347 


347 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


73 


73 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


108 


108 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


113 


113 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


124 


124 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


259 


259 


N-LINKED (GLCNAC. . .) (POTENTIAL) 



FT SITE 23 5 237 CELL ATTACHMENT SITE (POTENTIAL) 

SQ SEQUENCE 382 AA; 43846 MW; 3 124D8C34 EA74D72 CRC64 ; 



Query Match 38.9%; Score 832; DB 13; Length 382; 

Best Local Similarity 45.9%; Pred . No. 2.7e-65; 

Matches 187; Conservative 64; Mismatches 108; Indels 48; Gaps 15; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRL---ASPPSQGDVPPG 71 

I = = = I I I MMMIMMMIMIIMMI 1 1 II MM I II 

Db 6 LMLVWYTVGN-VSGMSTCKTLDLEMVKKKRIEAIRSQILSKLRLPKEPEPDQAGDEEEI 64 

Qy 72 PLPEAVLALYNSTRDRVAGESVEPE PEPEADYYAKEVTRVLMVESGNQI YDKFKG 126 

II M = IMM = = = : I = I =1 = 11 I I 

Db 65 PTP- -LLSLYNSTKEMLKEQQTEVQTDISTEQEEEEYFAKVLHKFNMTRKNN 114 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKV- -EQHVELYQKYSNDSWRYLSN 184 

I = I M 11 = 11 = 1 : 11= 1 111 = 1 I = II MM II Ml = = 

Db 115 TDTTKKMFFNI SEI RESVGDYRLLTSAELRMLI KKTTI YDEQRVELYSGL-GDSPRYLAS 173 

Qy 185 RLLAPSDSPEWLSFDVTGWRQWLTRREAIEGF--RLSAHSSSDSKDNTL-HVEINGFNS 241 

I = =1111111 == II : =11 II =11= 1 = 1 = 

Db 174 RFI TNKWKDKWLSFDVTKTLQDWLKGTDDEQGFQLRLFCECNKVSAGETI FKFGI SGI DP 233 

Qy 242 GRRGDLATIHGMNR- -PFLLLMATPLERAQHLHS SRHRRALDTNDYKDDDDKALDTN 296 

I III = = = l = = l 1= I M= I M =1= =1 I 

Db 234 G - RGDTGPMQLLTQQP PY I LTMS I P QNISSPSTSRKKRSTETKD 276 

Qy 297 YCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALY 356 

I = II II 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 h I I h 1 1 = I = I I 111= = = 1 1 = = I M I 

Db 277 VCTAQTE-TCCVRSLYIDFRKDLGWKWIHKPTGYNANYCMGSCTYIWNAENKYSQILALY 335 

Qy 357 NQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 4 03 

MM 1 I I I I I I I I I I I I : I I I I h I I I I I I I I 1 I U 1 II I 
Db 336 KHHN PGASAQ PCCVPQALE PLP I LY Y VGRQHKVEQLSNM I VKS CKCS 382 



RESULT 7 
Q91VP5 

ID Q91VP5 PRELIMINARY; PRT; 414 AA. 

AC Q91VP5; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to transforming growth factor, beta 2. 

GN TGFB2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Breast tumor; 

RA Strausberg R. ; 

RL Submitted (JUL-2001) to the EMBL/GenBank/DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; BC011170; AAH11170.1; 

DR MGD; MGI: 98 726; Tgf b2 . 



DR InterPro; I PRO 024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb__N . 

DR InterPro; IPR003911; TGF_TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR004 38; GFCYSKNOT . 

DR PRINTS; PRO 1423; TGFBETA. 

DR ProDOTTl; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1, 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 414 AA; 47588 MVJ; DB37A7C3888 1F286 CRC64; 

Query Match 38.6%; Score 826; DB 11; Length 414; 

Best Local Similarity 43.7%; Pred. No. le-64; 

Matches 188; Conservative 63; Mismatches 121; Indels 58; Gaps 15; 
Qy 12 LLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVP-P 70 

= 1 II I I I MM hlh 1 1 1 1 1 1 1 1 1 1 1 M 1 1 : 1 Ml III 

Db 5 VLSTFLLLHLVP- - VALSLSTCSTLDMDQFMRKRI E1AIRGQI LSKLKLTSPPE- -DYPEP 60 

Qy 71 GPLPEAVLALYNSTRD RVAGESVEPEPE - PEADYYAKE VTRVLM VESGNQI YD 122 

•I I : : : I M M I = = : II • ' I I I I I I — I = I I I 

Db 61 DEVPPEVI SI YNSTRDLLQEKASRRAAACERERSDEEYYAKEVYKIDMPSHLPSENAI PP 120 

Qy 123 KFKGTPHSLYMLFNTS ELREAVPEPVLLSRAELRLLRL KLKV- EQHVELYQ K 173 

I h : | : | : : I Ml I II I M II Mill 

Db 121 TF Y - RPYFR I VRFDVSTMEKNASN LVKAEFRVFRLQNPKARVAEQR I ELYQI LKSKD 176 

Qy 174 YSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSS 225 

- : lh : = = = MMMM hMI == IhM I 

Db 177 LTSPTQRYIDSKWKTRAEGEWLSFDVTDAVQEWLHHKDRNLGFKISLHCPCCTFVPSNN 236 

Qy 226 DSKDNTLHVE 1 NGFNSGRRGDLATI HGMNR PFLLLMATPLERAQHLHS 273 

M I 1 = 1 == II M : I MM I I : I 

Db 237 YI I PNKSEELEARFAGIDGTSTYASGDQKTIKSTRKKTSGKTPHLLLMLLPSYRLESQQS 296 

Qy 274 SRHRRALDTNDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHAN 333 

II - MM III : : II IM MM MM III I II I I II MM I 

Db 297 SRRKK RALDAAYCFRNVQDNCCLRPLYIDFKRDLGWKWIHEPKGYNAN 344 

Qy 334 FCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLS 393 

II I I MM I II MM I Mill II MM II 1 1 I MM hi hi MM I II 

Db 34 5 FCAGACPYLWSSDTQHTKVLSLYNTINPEASASPCCVSQDLEPLTILYYIGNTPKI EQLS 4 04 

Qy 3 94 NMIVRSCKCS 4 03 

II II M I I M 
Db 4 05 NMIVKSCKCS 414 

RESULT 8 
093449 

ID 093449 PRELIMINARY; PRT; 382 AA. 

AC 093449; Q91217; 

DT 01-NOV-1998 (TrEMBLrel . 08, Created) 

DT 01-NOV-1998 (TrEMBLrel. 08, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 



DE Transforming growth factor beta precursor. 

GN TGF-BETA OR TGF . 

OS Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; 

OC Protacanthopterygii ; Salmoni formes ; Salmonidae; Oncorhynchus. 

OX NCBI_TaxID=8022 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TI SSUE=LEUKOCYTE ; 

RX MEDLINE=99242020; PubMed=1022748 1 ; 

RA Daniels G.D., Secombes C. J. ; 

RT "Genomic organisation of rainbow trout, Oncorhynchus mykiss TGF- 

RT BETA."; 

RL Dev. Comp. Immunol. 23:139-147(1999). 

RN [2] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TI SSUE= LEUKOCYTE ; 

RX MEDLINE=98390168; PubMed=9722 928 ; 

RA Hardie L.J., Laing K. J. , Daniels G.D., Grabowski P.S., Cunningham C. , 

RA Secombes C. J. ; 

RT "Isolation of the first piscine transforming growth factor beta gene: 

RT analysis reveals tissue specific expression and a potential regulatory 

RT sequence in rainbow trout (Oncorhynchus mykiss)."; 

RL Cytokine 10:555-563(1998). 

CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 

CC RESPONSE. MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS. 

CC -!- SUBUNIT: HOMODIMER , DISULFIDE -LINKED (BY SIMILARITY). 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN BLOOD LEUKOCYTES, KIDNEY 

CC MACROPHAGES, BRAIN, GILL AND SPLEEN BUT NOT IN LIVER. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL ; AJ007836; CAA07707.1; 

DR EMBL; X99303; CAA67685.1; -. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; IPR003911; TGFJTGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA__1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


20 


POTENTIAL. 


FT 


PROPEP 


21 


270 




FT 


CHAIN 


271 


382 


TRANSFORMING GROWTH FACTOR BETA. 


FT 


DISULFID 


278 


286 


BY SIMILARITY. 


FT 


DISULFID 


285 


348 


BY SIMILARITY. 


FT 


DISULFID 


314 


379 


BY SIMILARITY. 


FT 


DISULFID 


318 


381 


BY SIMILARITY. 


FT 


DISULFID 


347 


347 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


76 


76 


N-LINKED ( GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


116 


116 


N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


125 


125 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 



FT CONFLICT 


237 


237 


N -> D (IN REF . 2) . 


FT CONFLICT 


345 


345 


Q -> H (IN REF . 2) . 


FT CONFLICT 


371 


372 


LS - > VP (IN REF. 2) . 


FT CONFLICT 


377 


377 


K -> M (IN REF . 2) . 


SQ SEQUENCE 


382 AA; 


44136 MW; 


93BD4D3540084B92 CRC64 ; 


Query Match 




37.9%; 


Score 810; DB 13; Length 



Best Local Similarity 46.8%; Pred. No. 2.4e-63; 

Matches 182; Conservative 58; Mismatches 105; Indels 44; Gaps 14; 

Qy 30 LSTCKTIDMELVKRKRIEAIRGQILSKLRLASPP SQGDVPPGPLPEAVLALYNSTRD 86 

:||lh: hi I Ml Mil II I II II Mill I Ml :| =::::|||| : 

Db 23 MSTCKSLDLELVKRKRI EAI RGQI LSKLRLPKEPE I DQEGDTE - - EVPASLMS I YNSTVE 8 0 

Qy 87 RVAGESVE PEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSEL 140 

- I I : I II Mil II I M I : I = Ml Ih 

Db 81 -LSEEQVHTYI PSTQDAEEEA- YFAKEVHKFNMKQSENT SKHQI - -LFNMSEM 129 

Qy 141 REAVPEPVLLSRAELRLL RLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWL 196 

I : II hi I III I I II MIM | Ml : :: : | = 

Db 130 RS VLGTDRLLSQAELRLL I KNHGLLDDS EQRLELYRGV - GDKAR YLKSHF VS KEWANRWV 188 

Qy 197 SFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATI - -HGMN 254 

Mill : Ml MIM MM I Ih h 

Db 189 SFDVTQTLNEWLQGAGEEQGFQLKLPCDCGKPMEEFRFKISGMNK-LRGNTETLAMKMPS 247 

Qy 255 RPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFSSTEKNCCVRQLYID 314 

M Mlh hi I I III M I : II -I II hi Ml 

Db 248 KPHI LLMSLPVERHSQL - SSRKKRQTTTEE I C-SDKSESCCVRKLYID 293 

Qy 315 FRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQAL 374 

I I MM MM III II 1 1 MM 1 1 Mh = MIMIIII III MM MUM I 

Db 294 FRKDLGWKWIHEPTGYFANYCIGPCTYIWNTENKYSQVLALYKHHNPGASAQPCCVPQVL 353 

Qy 375 EPLPIVYYVGRKPKVEQLSNMIVRSCKCS 403 

I I II h I II I h I II II II I I h II M I 
Db 354 EPLPIIYYVGRQHKVEQLSNMI VKSCRCS 382 



RESULT 9 
Q9PTQ2 

ID Q9PTQ2 PRELIMINARY; PRT; 376 AA . 

AC Q9PTQ2; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

OS Cyprinus carpio (Common carp) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Ostariophysi ; Cyprini formes ; 

OC Cyprinidae; Cyprinus. 

OX NCBI_TaxID=7 962; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Yin Z. , Kuang J. ; 

RT "Molecular cloning of carp transforming growth factor beta 1."; 

RL Submitted (MAR- 1999) to the EMBL/ GenBank/DDB J databases. 



CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 

CC RESPONSE. MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS . 

CC -!- SUBUNIT: HOMODIMER, DI SULF I DE - LI NKED (BY SIMILARITY). 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF136947; AAF22573.1; -. 

DR HSSP; P01137; 1KLA . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; I PRO 03 911; TGF_TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD0 0 03 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS002 50; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 22 POTENTIAL. 

FT PROPEP 23 264 

FT CHAIN 265 376 TRANSFORMING GROWTH FACTOR BETA. 

FT DISULFID 272 280 BY SIMILARITY. 

FT DISULFID 308 373 BY SIMILARITY. 

FT DISULFID 312 375 BY SIMILARITY. 

FT DISULFID 341 341 INTERCHAIN (BY SIMILARITY) . 

FT CARBOHYD 76 76 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 125 125 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 167 167 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT SITE 23 0 232 CELL ATTACHMENT SITE (POTENTIAL) . 

SQ SEQUENCE 376 AA; 43329 MW; 7F7FC4DA58B6968 1 CRC64 ; 

Query Match 36.5%; Score 780; DB 13; Length 376; 

Best Local Similarity 44.0%; Pred. No. l.le-60; 

Matches 179; Conservative 65; Mismatches 121; Indels 42; Gaps 15; 

Qy 6 LRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLASPPSQ 65 

= 1= Ml I M ^ I I I I 'h MM lllll II MM I II II h I 

Db 1 MRVESLLLALQCLLGFV- - HYSGALSTCSPLDLELI KRKRI EAI RGQI LSKLRLSKEPEV 58 

Qy 66 GDVPPGP-LPEAVLALYNST RDRVAGESVEPEPEP- EADYYAKEVTR- - VLMVESGN 118 

:| ::::|MI : | M ' * I I M II I M • • • M 

Db 59 DEEKESQNI PAELISVYNSTVELNEEQAAPPEQPKEDPVEEEYYAKEVHKFTIKLMEKNP 118 

Qy 119 QI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLK- - VEQHVELYQKYSN 176 

Ml : II ::= = : =:|= Mill II Mill I 

Db 119 ---DKF LW FN I TD I SQTLGLNR 1 1 SQVELRLLI TTF PDGS EQRLELYQV I GN 167 

Qy 177 DSWRYLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEI 236 

I Ml M = : MMMM == II I II Mhl : | : | ::| 

Db 168 KS-RYLESRFI - - PNQRKWLSFDVTQTLKDWLQRSEAEQGFQLKMADNCDPQ-KTFQLKI 223 

Qy 237 NGFNSGRRGDLATIH-GMNRPFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDT 2 95 

I Ml h I II MM: ||: :||: || | :| 
Db 224 PGLVL-VRGDTETLAVNMPRPHILVMSLPLDGN---NSSKSRRKRQT ET 268 

Qy 296 NYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVTiAL 355 

: : MM I I I I I I I I I II I II I I I I I M I M I I : M : : M I M I I I 

Db 269 DQVCTDKSDGCCVRSLYIDFRKDLGWKWIHEPSGYYANYCTGSCSFVWTSENKYSQVLAL 328 



Qy 356 YNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKC 4 02 

I lllllll II III I MM llllh I I i I I I I I I I - I I I 
Db 329 YKHHNPGASAQPCRVPQVLNPLPI FYYVGRQHKVEQLSNMI VKTCKC 375 

RESULT 10 
Q9ERB7 

ID Q9ERB7 PRELIMINARY; PRT; 399 AA. 

AC Q9ERB7; 

DT 01-MAR-2001 (TrEMBLrel . 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Transforming growth factor-beta 2 (Fragment) . 

OS Mesocricetus auratus (Golden hamster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Cricetinae; 

OC Mesocricetus. 

OX NCBIJTaxID=10036; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Ramesh G., Kondaiah P., Seshagiri P.B.; 

RT "Differential expression and selective localization of transforming 

RT growth factor-beta isoforms in the hamster uterus during estrous 

RT cycle."; 

RL Submitted (AUG-2000) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL ; AY007214; AAG02247.1; -. 

DR HSSP; P08112; 2TGI . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; IPR003911; TGF_TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PRO 14 23; TGFBETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART ; SM0 02 04; TGFB; 1. 

DR PROSITE; PS0025 0; TGF_BETA_1 ; 1. 

FT NONJTER 1 1 

FT NONJTER 399 3 99 

SQ SEQUENCE 399 AA; 46078 MW; A6FF8E65EAFD5 14 8 CRC64 ; 

Query Match 35.8%; Score 765; DB 11; Length 399; 

Best Local Similarity 42.5%; Pred. No. 2.5e-59; 

Matches 177; Conservative 60; Mismatches 121; Indels 58; Gaps 15; 
Qy 18 LLVLTPGR PAAGLSTCKT I DMELVKRKR I EAI RGQ I LS KLRLAS P PSQGDVP - PG PLP EA 76 

Ml I I MM 1 = 11= MMMMMMMIM Ml Ml M 

Db 4 LLHLVP- -VALSLSTCSTLDMDQFMRKRIEAIRGQILSKLKLTSPPE- -DYPEPDEVPPE 59 

Qy 77 VLALYNSTRD RVAGESVEPEPE - PEADYYAKEVTRVLM VESGNQI YDKFKGTP 128 

I : : : I I I I I I : = = I I : MMMI = ' I I I I I I 

Db 60 VISI YNSTRDLLQEKASRRAAACERERSDEEYYAKEVYKIDMPSHFPSENAI PPTFY-RP 118 

Qy 129 HSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKV- EQHVELYQ KYSNDSW 179 

= = 1= I = = I Ml h II I M II Mill - = 

Db 119 YFR I VRFDVSMMEKNASN LVKAEFRVFRLQNPKARVAEQRI ELYQI LKSKDLTS PTQ 175 



Qy 180 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSK 228 

M : : = := I I I I I I I I I : I I - II- I I ' : I 

Db 176 RYIDSKWKTRAEGEWLSFDVTDAVHEWLHHKDRNLGFKISLHCPCCTFVPFNNNIIPNK 235 

Qy 22 9 DNTLHVE INGFNSGRRGDLATIHGMNR PFLLLMATPLERAQHLHSSRHRRA 27 9 

I |:| : III = I MM I I : hi - 

Db 236 SEELEARFAGIDGTSQHSSGHQETIKSTRKKNSGKTPHLLLMLLPSYRLESQQSNRRKK- 294 

Qy 280 LDTNDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPC 339 

:MI Ml : : Mhl I I I I I - I I I M I M I I M h I I I I I I 
Db 295 RALDAAYCFRNVQDNCCLRPLYIDFKRDLGWKWIHEPKGYNANFCAGAC 343 

Qy 34 0 PY I WSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM 3 95 

Ihll llhMlhlll II Mhl Ml MM 1 = 11 = 1 I hi Mill 

Db 344 PYLWSSDTQHTKVLSLYNTINPEASASPCCVSHDLEPLTILYYIGNTPKI EQLSNM 399 

RESULT 11 
Q99K17 

ID Q99K17 PRELIMINARY; PRT; 362 AA. 

AC Q99K17; 

DT 01-JUN-2001 (TrEMBLrel . 17, Created) 

DT 01-JUN-2001 (TrEMBLrel . 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to transforming growth factor, beta 3 (Fragment) . 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus . 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Strausberg R.; 

RL Submitted (MAR-2001) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; BC005513; AAH05513.1; 

DR HSSP; P10600; 1TGJ. 

DR MGD; MGI : 98727; Tgf b3 . 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; IPR003911; TGFJTGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688 ; TGFbjpropept ide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 362 AA; 41486 MW; 08 08E4 6 1 8 0FDAE7 0 CRC64 ; 

Query Match 33.9%; Score 724.5; DB 11; Length 362; 

Best Local Similarity 43.4%; Pred. No. 8.5e-56; 

Matches 164; Conservative 52; Mismatches 113; Indels 49; Gaps 13; 



Qy 



58 RLASPPSQGDVPPGPLPEAVLiALYNSTR DRVAGESVE - - PEPEPEADYYAKEVTRVL 112 



h III = =1 IIIMIMI : : II I = hMlllh = 

Db 2 RVGSPPEPSVMT- -HVPYQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEIHKFD 59 

Qy 113 MVE SGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVE 165 

h: I- II :: || I = = I III 1 = 11= = I 

Db 60 MIQGLAEHNELAVCPKGITSKVFR-FNVSSVEK NGTNLFRAEFRVLRVPNPSSKRTE 115 

Qy 166 QHVELYQKYSND SWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSA 221 

I =11 = 1 I 11= : I = II I MM I 11 = 11 111= I =1 

Db 116 QRIELFQ1LRPDEHIAKQRYIGGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEISI 175 

Qy 222 H S S SDS KDN TLH VE I NGFNS GRRGDLAT I HGM NR P FLLLMAT PL 265 

I : | :| : :: | :: I I I I = =11 = 11 I 

Db 176 HCPCHTFQPNGDI LENVHEVMEI KFKGVDNEDDHGRGDLGRLKKQKDHHNPHLI LMMI PP 23 5 

Qy 266 ERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIH 32 5 

I |= =11111111 : hlllll lllllhlllllhl 

Db 236 HRLDSPGQGSQRK KRALDTN YCFRNLEENCCVRPLY I DFRQDLGWKWVH 284 

Qy 32 6 E PKGYHANFCLG PCP Y I WSLDTQYS KVLALYNQHNPGASAAPCCVPQALE PLP I VYYVGR 385 

Illlhllll I II 11= I II =1 II III II llhlllill MM hlllll 

Db 285 EPKGYYANFGSGPCPYLRSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGR 344 

Qy 386 KPKVEQLSNMIVRSCKCS 4 03 

IMIIMIhhlllll 
Db 345 TPKVEQLSNMWKSCKCS 362 

RESULT 12 
Q08714 

ID Q08714 PRELIMINARY; PRT; 130 AA . 

AC Q08714; O70331; 

DT 01-NOV-1996 (TrEMBLrel . 01, Created) 

DT 01-NOV-1996 (TrEMBLrel . 01, Last sequence update) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last annotation update) 

DE Transforming growth factor beta 1 (TGF-beta 1) (Fragment) . 

GN TGFB1 . 

OS Mesocricetus auratus (Golden hamster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Cricetinae; 

OC Mesocricetus. 

OX NCBI_TaxID=10036; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC STRAIN=LVG (SYR) ; 

RX MEDLINE=93304479; PubMed=83 17544 ; 

RA Wong D.T., Donoff R.B., Yang J., Song B.Z., Matossian K. , Nagura N. , 

RA Elovic A., McBride J., Gallagher G. , Todd R. ; 

RT "Sequential expression of transforming growth factors alpha and beta 1 

RT by eosinophils during cutaneous wound healing in the hamster."; 

RL Am. J. Pathol. 143:130-142(1993), 

RN [2] 

RP SEQUENCE OF 26-115 FROM N.A. 

RC STRAIN=SYRIAN; TI SSUE=SPLEEN ; 

RX MEDLINE=98234044 ; PubMed=9573 1 0 0 ; 

RA Melby P.C., Tryon V.V., Chandrasekar B., Freeman G.L.; 

RT "Cloning of Syrian hamster (Mesocricetus auratus) cytokine cDNAs and 



RT analysis of cytokine mRNA expression in experimental visceral 

RT leishmaniasis . 11 ; 

RL Infect. Immun. 66:2135-2142(1998). 

CC -J- FUNCTION: TGF-BETA 1 IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROLS 
CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF 

CC THEM HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 

CC REGULATES THE ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND 

CC DETERMINES A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: HOMODIMER, DISULFIDE-LINKED . 

CC -!- SIMILARITY: TO OTHER GROWTH FACTORS OF THE TGF-BETA FAMILY. 

DR EMBL; X60296; CAA42838.1; 

DR EMBL; AF046214; AAC40099.1; -. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR00183 9; TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 

FT NON_TER 1 1 

FT PROPEP <1 18 

FT CHAIN 19 130 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 25 34 BY SIMILARITY. 

FT DISULFID 33 96 BY SIMILARITY. 

FT DISULFID 66 129 BY SIMILARITY. 

FT DISULFID 95 95 INTERCHAIN (BY SIMILARITY) . 

FT CONFLICT 93 93 G -> S (IN REF . 2) . 

SQ SEQUENCE 130 AA; 14997 MW; 8B4 1DD6CF39CCA77 CRC64 ; 

Query Match 32.4%; Score 693.5; DB 11; Length 130; 

Best Local Similarity 88.8%; Pred. No. l.le-53; 

Matches 127; Conservative 0; Mismatches 3; Indels 13; Gaps 1; 

Qy 261 MATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLG 32 0 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 1 MATPLERAQHLQSSRHRR ALDTNYCFSSTEKNCCVRQLY I DFRKDLG 47 

Qy 321 WKW I HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHNPGASAAPCCVPQALE PLP I V 380 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I llllllllllllll 
Db 4 8 WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAGPCCVPQALEPLPI V 107 

Qy 381 YYVGRKPKVEQLSNMI VRSCKCS 4 03 

M 1 1 ! II II II 1 1 1 1 1 II ' III 

Db 108 YYVGRKPKVEQLSNMI VRSYKCS 13 0 

RESULT 13 
Q98854 

ID Q98854 PRELIMINARY; PRT; 361 AA . 

AC Q98854; 

DT 01-FEB-1997 (TrEMBLrel . 02, Created) 

DT 01-FEB-1997 (TrEMBLrel. 02, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 2 (TGF-beta 2) (Fragment) . 

GN TGFB2 , 

OS Cyprinus carpio (Common carp) . 



OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata,- Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Ostariophysi ; Cyprini formes ; 

OC Cyprinidae; Cyprinus. 

OX NCBI_TaxID=7962 / 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TI SSUE=HEART ; 

RX MEDLINE=973543 01; PubMed=92 10595 ; 

RA Sumathy K. , Desai K.V. , Kondaiah P. ; 

RT "Isolation of transforming growth factor-beta2 cDNA from a fish, 

RT Cyprinus carpio by RT-PCR." ; 

RL Gene 191:103-107(1997). 

CC -!- FUNCTION : TGF-BETA 2 HAS SUPPRESSIVE EFFECTS ON I NTERLEUKI N - 2 
CC DEPENDENT T-CELL GROWTH . 

CC -!- SUBUNIT: HOMODIMER , DISULFIDE -LINKED (BY SIMILARITY). 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL ; U66874; AAB62983.1; 

DR HSSP; P08112; 2TGI . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR InterPro; IPR003911; TGF_TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 



DR 


PRINTS; PR01423; 


TGFB ETA . 




DR 


ProDom; PD0003 57; 


TGFb; 1. 




DR 


SMART; SM002 04; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein. 


FT 


NON_TER 


1 


1 




FT 


PROPEP 


<1 


257 




FT 


CHAIN 


258 


361 


TRANSFORMING GROWTH FACTOR BETA 2. 


FT 


DISULFID 


264 


273 


BY SIMILARITY. 


FT 


DISULFID 


272 


335 


BY SIMILARITY. 


FT 


DISULFID 


334 


334 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


30 


30 


N-LINKED (GLCNAC . . . ) (POTENTIAL) 


FT 


CARBOHYD 


98 


98 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


199 


199 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


NON_TER 


361 


361 




SQ 


SEQUENCE 


361 AA 


; 41931 MW; 


94D930FA970A3FD3 CRC64 ; 



Query Match 31.7%; Score 678.5; DB 13; Length 361; 

Best Local Similarity 40.8%; Pred. No. le-51; 

Matches 155; Conservative 55; Mismatches 115; Indels 55; Gaps 14, 

Qy 52 QILSKLRLASPPSQGDVPPGP- -LPEAVLALYNSTRDRVAGESVEPEPEPE ADYY 104 

III Ihh II == I I = -hllllll : := I I :|| 

Db 1 QILCKLKLSCPP EI YPEPEEVSRDI IAI YNSTRDLLQEKANERAATCERQRTGEEYY 57 

Qy 105 AKEVTRVLM VESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL- 160 

MM :: I : I I I h = h I = = I Mill* M 

Db 58 AKEVHKIDMQPFYPAENVI PTKHY-NPYFRRLRFDVSSMEKNASN LVKAELRIFRLQ 113 

Qy 161 - - KLKV- EQHVELYQ KYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRRE 212 

I M M Mill : Ih =: = = Illlllll I Ml h 

Db 114 NPKARVSEQRIELYQILGHKDLTSPTQRYIDSKWRTRTEGEWLSFDVTEAVSEWLLHRD 173 



QY 



213 AIEGFRLSAHSSS 



DS KDNTLHVE INGFNSG- - RRGDLAT I HGMNR 2 55 



IMM I :| I I : Ml I 

Db 174 RNNGFKISLHCPCCTFVPSNNYI I PNKSEELEARFAGIDDSFVHGGDLKMFKKRRHSGQS 233 

Qy 256 PFLLLMATPLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDF 315 

I MM I I : MM* MM Ml = : MIM lllll 

Db 234 PHLLLMLLPSYRLESQHKS -HRQ KRALDAAFCFRNVQDNCCLRSLYIDF 281 

Qy 316 RKDIi3WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALE 375 

MIMMMMMMMMI I MIM MIM M Ml M MIMIM I M 

Db 282 KKDLGWKWIHEPKGYNANFCAGACPYLWSADTQHSNILGLYNTINPEASASPCCVSQDLE 341 

Qy 376 PLPIVYYVGRKPKVEQLSNM 3 95 

II IMMM 11 = 111111 

Db 342 PLTILYYIGKTPKIEQLSNM 361 

RESULT 14 
Q95N80 

ID Q95N8 0 PRELIMINARY; PRT; 124 AA. 

AC Q95N8 0; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1 (Fragment) . 

OS Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ■ 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Fonfara S., Groene A., Baumgaertner W. ; 

RT "Sequence of canine transforming growth factor beta 1 mRNA in DH82- 

RT cells."; 

RL Submitted (FEB-2001) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF349538; AAK54072.1; -. 

DR InterPro; IPR001839; TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

FT NONJTER 1 1 

FT NONJTER 124 124 

SQ SEQUENCE 124 AA; 14329 MW; 21D185218E5556DB CRC64 ; 

Query Match 31.6%; Score 676.5; DB 6; Length 124; 

Best Local Similarity 89.8%; Pred . No. 3.3e-52; 

Matches 123; Conservative 0; Mismatches 1; Indels 13; Gaps 1; 

Qy 264 PLERAQHLHSSRHRRALDTNDYKDDDDKALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKW 323 

M M I I I I I I M M I I I I I I I I I I I I I I I I I I 1 M I I I I I I I I I I I 

Db 1 PLERAQHLHSSRQRR ALDTNYCFSSTEKNCCVRQLY I DFRKDLGWKW 4 7 

Qy 324 IHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYV 383 

M 1 1 1 1 1 1 1 1 1 1 II M I MIMMMMMMMMI 1 1 M 1 1 M M M M M M M M I 

Db 48 IHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYV 107 



Qy 384 GRKPKVEQLSNM I VRS C 4 00 

1 1 II III Ml 1 , 1 1 1 1 1 

Db 108 GRKPKVEQLSNMI VRSC 124 

RESULT 15 
002730 

ID 002730 PRELIMINARY; PRT; 112 AA . 

AC 002730; 097501; 

DT 01-JUL-1997 (TrEMBLrel . 04, Created) 

DT 01-JUL-1997 (TrEMBLrel. 04, Last sequence update) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last annotation update) 

DE Transforming growth factor beta 1 (TGF-beta 1) (Fragment) . 

GN TGFB1 OR TGF-BETA- 1 . 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha ; Leporidae; Oryctolagus. 

OX NCBI_TaxID=9986; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Taylor T.K., James E.R., McGonigle S., Yoho E.R.; 

RL Submitted (APR- 1997) to the EMBL/ GenBank/ DDB J databases. 

RN [2] 

RP SEQUENCE OF 2-99 FROM N.A. 

RA Inoue K. , Kawabe Y., Kodama T. ; 

RL Submitted (NOV- 1998) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: TGF-BETA 1 IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROLS 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF 

CC THEM HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 

CC REGULATES THE ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND 

CC DETERMINES A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: HOMODIMER, D I SULF I DE -LI NKED (BY SIMILARITY) . 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF000133; AAB53806.1; -. 

DR EMBL; AB020217; BAA36950.1; -. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS0025 0; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 



FT 


NONJTER 


1 


1 




FT 


CHAIN 


1 


112 


TRANSFORMING GROWTH FACTOR BETA 1 


FT 


DISULFID 


7 


16 


BY SIMILARITY. 


FT 


DISULFID 


15 


78 


BY SIMILARITY. 


FT 


DISULFID 


44 


109 


BY SIMILARITY. 


FT 


DISULFID 


48 


111 


BY SIMILARITY. 


FT 


DISULFID 


77 


77 


INTERCHAIN (BY SIMILARITY) . 


FT 


CONFLICT 


2 


3 


LD -> FS (IN REF. 2) . 


FT 


CONFLICT 


85 


92 


PLPIVYYV -> ATAHRVTTL (IN REF . 2) 


SQ 


SEQUENCE 


112 AA; 


12795 


MW; 53C5B7D46355A6F3 CRC64 ; 



Query Match 



2 9.8%; Score 63 8; DB 6; Length 112; 



Best Local Similarity 100.0%; Pred. No. 7.3e-49; 

Matches 112; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 292 ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPY1WSLDTQYSK 351 

I I Ml I I I I I I I I I I I I I ll M : I II , I I I I I ! 1 I I I I I I II I I I I I I I i 1 1 1 1 , J 1 1 1 
Db 1 ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSK 60 

Qy 352 VLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 403 

1 1 1 ,i 1 1 !i ii 1 1 1 1 1 1 , ii i ii 1 1 1 1 1 1 1 1 1 1 1 ii ,ii mi i ii 1 ii 1 1 : i 

Db 61 VXALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 112 



Search completed: October 28, 2003, 09:12:26 
Job time : 33.3099 sees 



GenCore version 5.1.6 
Copyright (c) 19 93 - 2 003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



October 27, 2003, 10:47:27 ; Search time 5276.94 Seconds 

(without al ignments ) 
10489.161 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



US-10-017-372E-34 
1353 

1 gatctggtaccgagatggcg. 



. cgattaaagcggccgcgact 13 53 



I DENT I TY_NUC 

Gapop 10.0 , Gapext 1.0 

2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5777422 



Database 



GenEmbl : * 



1 




gb_ba : * 




2 




gb_htg : 




3 




gb in : * 




4 




gb om : * 




5 




gb_ov : * 




6 




gb_pat : 


★ 


7 




gb_ph : * 




8 




gb_pl : * 




9 




gb_pr : * 




10 


gb_ro : 


★ 


11 


gb_sts 




12 


gb_sy : 


★ 


13 


gb_un : 


★ 


14 


gb_vi : 


* 


15 


em_ba : 


★ 


16 


em fun 


. * 


17 


em hum 


: * 


18 


em_in: 


★ 


19 


em_mu : 


* 


20 


em om: 


* 


21 


em_or : 


★ 


22 


em_ov : 


★ 


23 


em_pat 


: * 


24 


em_ph : 




25 


em_pl : 


★ 


26 


em_ro : 


* 


27 


em sts 


; * 



O Q 


ctn liii . 


9 Q 


em vi : * 


J u 


cm iiLy num. 


7 i 
j i 


trill iiLy j_i.iv . 




trill iiLy ULiici . 


33 


em htg mus : * 


34 


em htg pin : * 


35 


em_htg_rod : * 


36 


em_htg_mam : * 


37 


em_htg_vrt : * 


38 


em_sy : * 


39 


em htgo_hum : * 


40 


em_htgo_mus : * 


41 


em_htgo_other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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3206 
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PIGTGFB1A 


M23703 Sus scrofa 
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6 


90 
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AX338213 Sequence 
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GGTGFB1 


X12373 Porcine mRN 
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D0GTGFB1A 
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5 
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X76916 O.aries mRN 
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1780 
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BC000125 Homo sapi 


11 


987 


6 


73 


0 


1780 
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BC001180 
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72 
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E03028 
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1746 


9 


BC022242 


BC022242 Homo sapi 


15 


981 


8 


72 


6 


1560 
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106216 


106216 Sequence 2 


16 


981 


72 


5 


2745 


9 


HSTGFB1 


X02812 Human mRNA 


17 


979 


8 


72 


4 


1560 


6 


108268 


108268 Sequence 2 


18 


962 


4 


71 


1 


1569 


6 


106221 


106221 Sequence 3 


19 
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4 


70 


8 


1561 
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108275 


108275 Sequence 3 


20 


942 


8 


69 


7 
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9 


BT007245 


BT007245 Homo sapi 


21 


942 


8 


69 


7 


1173 


12 


BT007866 


BT007866 Synthetic 


22 


933 


6 


69 


0 


1173 


4 


ECRGFB1 


X99438 E.caballus 


23 


933 


4 


69 


0 


1176 


6 


AX615127 


AX615127 Sequence 


24 


931 


8 


68 


9 


1176 


6 


AX481432 


AX481432 Sequence 


25 


931 


8 


68 


9 


1176 


6 


AX615128 


AX615128 Sequence 


26 


927 


2 


68 


5 


1187 


4 


AF175709 


AF175709 Equus cab 


27 


904 


4 


66 


8 


1597 


10 


AF191297 


AF191297 Cavia por 


28 


876 


64 
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1641 


10 


BC013738 


BC013738 Mus muscu 


29 


862 


2 


63 
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1579 


10 


MUSTGFRNA 


M13177 Mouse trans 


30 


862 


2 


63 


7 


2094 


10 


MMU009862 


AJ009862 Mus muscu 


31 


842 


6 


62 


3 


1585 


10 


RNTGFB1 


X524 98 Rat mRNA fo 


32 


830 


8 


61 


4 


1117 


4 


BOVTGFB 


M36271 Bovine tran 


33 


806 


2 


59 


6 


1125 


10 


AF480858 


AF4 8 0858 Sigmodon 



34 


685 . 


, 4 


50 


. 7 


1675 


6 


103310 


103310 Sequence 1 


35 


657 . 


. 2 


48 


. 6 


1376 


6 


AX528533 


AX528533 Sequence 


36 


655 . 


, 6 


48 


. 5 


1389 


6 


AX528619 


AX52 8 619 Sequence 


37 


594 . 


. 6 


43 


. 9 


1352 


6 


AX528535 


AX528535 Sequence 


38 


589 


. 8 


43 


. 6 


1350 


6 


AX528615 


AX528615 Sequence 


39 


558 


. 8 


41 


. 3 


699 


6 


105434 


105434 Sequence 4 


40 


366. 


.4 


27 


. 1 


489 


6 


AX455100 


AX4 5 5100 Sequence 


41 


357 


. 6 


26 


.4 


650 


6 


AX336646 


AX336646 Sequence 


42 


357 


.6 


26 


. 4 


650 


9 


HUMTGFBA 


M38449 Homo sapien 


43 


357 


. 6 


26 


. 4 


862 


6 


103312 


103312 Sequence 3 


44 


339 


. 2 


25 


. 1 


469 


10 


MATGFB1 


X60296 M.auratus m 


45 


335 


24 


. 8 


1256 


5 


CHKTGFB4 


M31160 Gallus gall 



ALIGNMENTS 



RESULT 1 

PIGTGFB1A 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



gene 
CDS 



Euteleostomi; 
Sus . 



PIGTGFB1A 3206 bp mRNA linear MAM 31-MAR-1995 

Sus scrofa transforming growth factor beta-1 mRNA, complete cds . 
M23703 

M23703 . 1 GI :755044 
transforming growth factor-beta- 1 . 
Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; 
1 (bases 1 to 3206) 

Kondaiah, P. , Van Obberghen -Schilling, E . , Ludwig, R . L . , Dhar,R., 
Sporn,M.B. and Roberts, A. B. 

cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 

Evidence for alternate splicing and polyadenylat ion 

J. Biol. Chem. 263 (34), 18313-18317 (1988) 

89054010 

2461367 

On Apr 1, 1995 this sequence version replaced gi: 341017. 
Original source text: Sus scrofa (strain miniature swine) cDNA to 
mRNA. 

Location/Qualifiers 
1. .3206 

/organism= n Sus scrofa 11 
/mol_type= "mRNA" 
/strain^ "miniature swine" 
/db_xref ="taxon: 9823" 

/eel l_type= "peripheral blood lymphocyte" 
1. .3206 

/gene="TGF-beta-l" 
906. .2078 
/gene = " TGF - be t a - 1 " 
/ codon_start=l 

/product^" transforming growth factor-beta- 1 " 
/protein__id="AAA64616 .1" 
/db_xref="GI: 755045" 

/translation= " MP PSGLRLLPLLLPLLWLLVLTPGR PAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 



KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLiDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
polyA_signal 3186. .3191 

/gene= M TGF-beta-l" 
polyA_site 3206 

/gene="TGF-beta-l" 

BASE COUNT 645 a 1041 c 924 g 596 t 

ORIGIN 

Query Match 91.4%; Score 1236.4; DB 4; Length 3206; 

Best Local Similarity 96.3%; Pred. No. 1.5e-208; 

Matches 1292; Conservative 0; Mismatches 11; Indels 39; Gaps 1; 

C CGAGATGG CGCCTT CGGGG CTG CGG CT CTTG C CG CTG CTG CTG C CGCTG CTGTGG CTG C 6 9 

II III i ' II ,' I 'I ' ! I 

CCCCCATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 960 
TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 129 

I I I ' I I I I I I ' I I! 

TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 1020 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 
MM I ' I M M II M I I , 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 108 0 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I MM Ml IIIIMI I 1 1 IMM IM III MM III II II! IMM 1 1 II II 

TCGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGG 114 0 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 09 

II 1 1 M II I Ml 1 1 1 1 1 1 II I M 1 1 1 1 M 1 1 1 II I II II II M I III I II I M I M 1 1 M I 

CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 12 00 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 369 

I I M I II 1 1 II I M 1 1 M I M I M I M I II 1 1 M I II M 1 1 1 M 1 1 1 1 1 M I M II I II 

CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 1260 
AAATCTATGATAAATTO^AGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 9 

Ml Ml I I MIMII MM III Mill III Ml I Ml II Mill III II IM II II Ml 

AAATCTATGATAAATTCAAGGGCACCCCCCAC^GCTTATATATGCTGTTCAA.CA.CGTCGG 132 0 
AG CTC CGGG AAG CGG TG C CGGAA CCTGTATTG CT CT CTCGGG CAGAG CTG CG CCTG CTG A 4 8 9 

I I I M 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 M I II I II M 1 1 1 1 II M M II 1 1 1 Ml 1 1 1 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 13 8 0 
GGCTCAAGTTAAAAGTGGAGCAG(^CGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 9 

MMMIMIMM Mill llllll MIMMIMM IIMMMIMIMIMM 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATAC(^GAAATA(^GC^ATGATTCCT 144 0 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 609 

I i 1 1 1 1 1 . 1 1 1 1 1 1 1 1 ! 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 1500 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

I I M 1 1 M I 1 1 1 1 1 1 M I MINI 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 



Qy 


10 


Db 


901 


Qy 


70 


Db 


961 


Qy 


130 


Db 


1021 


Qy 


190 


Db 


1081 


Qy 


250 


Db 


1141 


Qy 


310 


Db 


1201 


Qy 


370 


Db 


1261 


Qy 


430 


Db 


1321 


Qy 


490 


Db 


1381 


Qy 


550 


Db 


1441 


Qy 


610 



Db 15 01 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 1560 

Qy 67 0 T CAGTG CCCACTGTT C CTGTG ACAG CAAAG ATAACA CA CT C CA CGTGG AAATTAACGGGT 72 9 

IIMIIIIIIIIIIIIIIIIIIillllllllllllllllMIIIIIIIIIIIMIIIII 

Db 1561 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCC^CGTGGAAATTAACGGGT 1620 

Qy 730 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 789 

IIIIIIIIIIIIIIIIMIIIilllMIIIIIMIMIIIIIIIIIIIIIIIIIIIIMI 

Db 1621 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 168 0 

Qy 7 90 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 9 

I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I 
Db 1681 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGA- 173 9 

Qy 850 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 909 

I I I I I I I I I I II I I I I I I I I I I 
Db 174 0 GCCCTGGATACCAACTACTGCT 1761 

Qy 910 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 969 

I I I I II I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 1762 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 1821 

Qy 970 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 9 

I I I I II I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1822 TGGG CTGG AAGTGGATTCATGAA CC CAAGGG CTAC CATG C CAATTT CTG C CTGGGG C CCT 1881 

Qy 103 0 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 9 

I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I II I I I 
Db 18 82 GTCCCTACATCTGGAGCCTAGACA.CTCAGTACAGCAAGGTCCTGGCTCTGTAGAACCAGC 1941 

Qy 10 90 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

M II M 1 1 1 1 1 1 1 1 1 1 i ! 1 1 ) 1) ) M i ) 1 1 1 1 II 1 1 1 1 1 M 1 1 ) 1 1 i 1 1) 1 1 1 1 1 i 1 1 1 ) 

Db 1942 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 2 001 

Qy 1150 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 09 

I I I I II II I I I I I I I I I I I I II I I II II I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I 
Db 2 0 02 TGGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 2 061 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1269 

IIIIIIIIIIIIIIIIMIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 2 062 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 2121 

Qy 127 0 CCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCCACTTG 132 9 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 2122 CCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCCACTTG 2181 

Qy 1330 GGATCGATTAAAGCGGCCGCGA 1351 

MM I II I III II I I II 

Db 2182 GGATCGATTAAAGGTGGAGAGA 2203 



RESULT 2 
AX338213 

LOCUS AX338213 1326 bp DNA linear PAT 09-JAN-2002 

DEFINITION Sequence 1 from Patent WO0181404. 

ACCESSION AX338213 

VERSION AX338213.1 GI: 18128750 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata,- Euteleostomi ; 

Mammalia; Eutheria; Cetart lodactyla ; Suina; Suidae; Sus. 

1 

Strober,W., Nakamura,K., Kitani,A. and Fuss, I. J. 

Inducible plasmid vector encoding tgf-_g(b) and uses thereof 

Patent: WO 0181404-A 1 01-NOV-2001; 

THE SECRETARY OF THE DEPARTMENT OF HEALTH AND HUMAN SERVICES (US) 
Location/Qualifiers 
1. .1326 

/organism="Sus scrofa" 
/mol_type= "genomic DNA" 
/db_xref ="taxon: 9823" 
16. .1188 

/note=" unnamed protein product" 
/ codon_start=l 
/protem_id="CAD20538 .1" 
/db_xref ="GI : 18128751" 

/ 1 rans 1 a t ion= " MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
263 a 438 c 392 g 233 t 



Query Match 90 . 5%; 

Best Local Similarity 96.3%; 
Matches 1301; Conservative 



Score 1224 .6; DB 6; 
Pred. No. 2e-206; 
0; Mismatches 9; 



Length 1326; 
Indels 41; Gaps 



3; 



Qy 


1 


Db 


2 


Qy 


61 


Db 


62 


Qy 


121 


Db 


122 


Qy 


181 


Db 


182 


Qy 


241 


Db 


242 


Qy 


301 


Db 


302 



GATCTGGTACCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGC 6 0 

MM I llllll IM I llllll M.lll II II illll ll'llll ,1' I II M II 1 IMIM 

GATCTGGTACCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGC 6 1 
TGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCA 12 0 

I M 1 1 1 II II I III II II II I Ml II 1 1 II I M II MM MM II 1 1 1 1 1 1 II I M 1 1 1 

TGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCA 121 
TCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCA 180 

! I [ 1 1 ! 1 1 1 ; 1 1 1 1 1 1 1 1 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 > 1 1 1 1 1 1 : 1 1 i ! 1 1 1 1 1 : 1 

TCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCA 181 
AGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGG 24 0 

1 1 1 ) 1 1 1 1 1 1 Ml I III IIIIIMM II II Mill MIIMMM I II M II: Illll I 

AGCTTCGGCTCGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGG 241 
CAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGG 3 00 

I I 111 IN IIIIIMM: Ill I ! 1 1 1 1 1 1 1 1 1 ! MINI 1 1 

CCGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGG 301 
AGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAA 360 

MMIMMMMMMMMMMMIIMM II MMMMIMIMMMIMM 

AGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAA 361 



Qy 


361 


Db 


362 


Qy 


421 


Db 


422 


Qy 


481 


Db 


482 


Qy 


541 


Db 


542 


Qy 


601 


Db 


602 


Qy 


661 


Db 


662 


Qy 


721 


Db 


722 


Qy 


781 


Db 


782 


Qy 


841 


Db 


842 


Qy 


901 


Db 


863 


Qy 


961 


Db 


923 


Qy 


1021 


Db 


983 


Qy 


1081 


Db 


1043 


Qy 


1141 


Db 


1103 



GCGGCAACCAAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCA 4 2 0 

MIMIMIIIIIIIMIIIIMIIIIIIIIIMIIIIIIIMMIIIIIIIIIIIIIII 

GCGGCAACCAAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCA 421 

ACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGC 4 8 0 

MIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIMIIIMIIIIIIIIII 

ACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGC 4 81 

GCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCA 54 0 

1 1 1 II 1 1 1 1 ! 1 1 1 1 1 . 1 1 1 1 1 1 1 1 1 1 111111:1111 1 1 ! I ! 1 1 1 1 I ! ! I 1 1 1 1 

G C CTG CTGAGG CT CAAGTT AAAAGTGGAGCAG CACGTGG AG CT ATA C CAGAAAT A CAG CA 541 



IMMMIMM MM I MIM MM MIMM IMM Illlllllll llllll II! 



TGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGG 660 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 L 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGG 661 



I ! 1 1 . 1 1 1 1 II 1 1 1 ' 1 , 1 1 MMI Mill M Mill llllll MM llllll III 



TTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGC 7 8 0 

Ml I II M M MM I! 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 M M Ml II 1 1 1 1 II 1 1 1 M Ml Ml 

TTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGC 781 
CCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGC 84 0 

I! I II I M I M I M 1 1 M I II 1 1 1 1 M I II 1 1 II I II II i II II 1 1 1 1 1 1 1 MM M I 

CCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGC 841 

ACCGCCGAGCCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCA 90 0 

MMMM Illllllllllll 
ACCGCCGA GCCCTGGATACCA 862 

ACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCC 960 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 ; I . : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCC 922 
GGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCC 1020 

II I Ml M II M I M II I M II 1 1 1 1 M I II 1 1 II I II M M II I M 1 1 M Ml MM 1 1 

GGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCC 982 
TGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGT 108 0 

1 1 1 1 1 1 !1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 [ I [ 1 1 1 1 ( 1 1 1 II 1 1 1 1 1 1 1 1 1 1 [ I [ 1 1 1 1 

TGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGT 1042 
ACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGC 114 0 

1 1 1 II 1 1 II 1 1 1 II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 II I II M M II M 1 1 1 II Ml II II 1 1 

A CAA C CAG CA CAA CCCGGGCGCGTCGGCGGCGC CG TG CTG CGTG C CG CAGG CG CTGG AG C 1102 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : i : 1 1 1 1 1 1 1 1 1 1 1 1 1 ; 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ' 1 1 1 1 



Qy 
Db 

Qy 

Db 

Qy 
Db 



1201 TCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGC 12 60 

I I , 1 1 ' 1 1 1 1 1 1 1 ' , i . , ' I I , 1 1 1 I M 1 1 1 .1 1 1 1 II 1 1 1 II I , M 1 1 ! 1 1 1 1 

1163 TCGTGCGTTCCTGCAAGTGCAGCTGA - GCCCCGCCCCGCCCACAGCCCCGCCCACCCGGC 1221 
1261 AGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAA 1320 

, 1 1 J 1 1 1 1 1 ! 1 1 1 II 1 1 1 II 1 II II 1 1 1 1 1 M I ll M 1 . 1 ,1 1 1 1 II 1 1 1 M 1 1 1 II 

1222 AGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAA 12 81 

1321 GCCCACTTGGGATCGATTAAAGCGGCCGCGA 13 51 

III, I I I I M I I II I I I II I | I II 

12 82 GCCCAC - TGGGATCGATTAAAGGTGGAGAGA 1311 



RESULT 3 

GGTGFB1 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



gene 
5 *UTR 
CDS 



Euteleostomi ; 
Sus . 

, A . B . 



(1988) 



B . , National Institute of 
Laboratory of Chemoprevention, 



GGTGFB1 1750 bp mRNA linear MAM 27-MAR-1996 

Porcine mRNA for transforming growth factor-beta 1. 

X12373 

X12373.1 GI:63808 

transforming growth factor-beta 1. 

Sus scrofa (pig) 

Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; 

1 (bases 1 to 1750) 

Jakowlew, S . B . , Dillard, P . J . , Sporn,M.B. and Roberts 
Nucleotide sequence of chicken transforming growth factor-beta 1 
(TGF-beta 1) 

Nucleic Acids Res. 16 (17), 8730 
88335639 
3166520 

2 (bases 1 to 1750) 
Jakowlew, S . B . 
Direct Submission 

Submitted (14 -JUL-1988) Jakowlew S 
health, National Cancer Institute, 

Building 41, Room B902, Bethesda, Maryland 20892, USA 
The submitters believe that the chicken cDNA library was 
contaminated with porcine cDNA, and that the sequence is infact 
porcine TGF-beta-1. 27-MAR-1996. 

Location/Qualifiers 

1. .1750 

/organism="Sus scrofa" 
/ mo 1 _ t yp e = " mRNA " 
/ strain="white leghorn" 
/db_xref="t axon: 9823" 
/clone="pTGFB-ChX119" 
/eel l_type= " chondrocyte " 
1. .1750 

/gene="TGF-beta 1" 
1. .446 

/gene= "TGF-beta 1" 
447. .1622 
/gene= "TGF-beta 1" 
/ codon_start=l 

/product=" transforming growth factor" 
/protein_id="CAA30933 .1" 



/db_xref="GI : 63809" 
/db_xref ="SWISS-PROT: P07200" 

/translation="MPPSGPGLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWGYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINAGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSR 
HRRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLD 
TQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS " 
mat_peptide 1285. .1619 

/gene="TGF-beta 1" 
3'UTR 1620. .1750 

/gene= H TGF-beta 1" 

BASE COUNT 325 a 627 c 479 g 319 t 

ORIGIN 



Query Match 89.9%; Score 1217; DB 4; Length 1750; 

Best Local Similarity 95.8%; Pred . No. 4.2e-205; 

Matches 1288; Conservative 0; Mismatches 15; Indels 42; Gaps 2; 



Qy 


10 


CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 


69 


Db 


442 


II III MINIMUM II II 1 1 1 : 1 Ml II i Ml 1 II IM 1 Ml 1 1 MM .1 

CCCCCATGCCGCCTTCGGGGCCTGGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 


501 


Qy 


70 


TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 

M II Mill Mill III Mill II II III MIMII III 1 1 1 1 1 1 1 1 1 1 1 1 III Mill 


129 






JDl 


Qy 


130 


AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 


189 


Db 


562 


621 


Qy 


190 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 IMMM 

TCGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGG 


249 


Db 


622 


681 


Qy 


250 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

MMMMMMMMMMIIMM IIIIMIIMIIIIIIIIIIIIIIIMMIIMI 

CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 


309 


Db 


682 


741 


Qy 


310 


C^GAGGCGGACTACTACGC(^GGAGGT(^CCCGCGTGCTAATGGTGGAAAGCGGCAACC 


369 


Db 


742 


1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 

CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 


801 


Qy 


370 


AAATCTATGATAAATTCAAGGGCACCCCCCAC^GCTTATATATGCTGTTCAACACGTCGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 1 1 

AAATCTATGATAAATTC^GGGCACCCCCC^CAGCTTATATATGCTGTTCAACACGTCGG 


429 


Db 


802 


861 


Qy 


430 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 


489 


Db 


862 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 f 1 1 1 1 1 1 1 1 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 


921 


Qy 


490 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

1 1 1 M M II II 1 1 1 1 1 III II 1 1 1 1 1 1 1 1 1 1 1 1 1 1! II II M 1 1 1 II 1 II 1 1 1 1 II 1 1 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 


549 


Db 


922 


981 



Qy 



550 



GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 



609 



Db 


982 


Qy 


610 


Db 


1042 


Qy 


670 


Db 


1102 


Qy 


727 


Db 


1162 


Qy 


787 


Db 


1222 


Qy 


847 


Db 


1282 


Qy 


907 


Db 


1303 


Qy 


967 


Db 


1363 


Qy 


1027 


Db 


1423 


Qy 


1087 


Db 


1483 


Qy 


1147 


Db 


1543 


Qy 


1207 


Db 


1603 


Qy 


1267 


Db 


1663 


Qy 


1327 


Db 


1723 



II II ! II , II I II 1 1 . II MM I M M II I II I M , II 1 1 1 1 IT IM III I; II : II 

GGGGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 1041 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

IMMMMMMMMMMMMMMMMMMMMMMMMMMMMMI 

ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 1101 

TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAAC- - -G 726 

I I I 1 I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I 1 I I 1 I 1 I I I I I I I I I I 1 I I I 
TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGCAG 1161 

GGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCG 786 

MMMMMMMMMMMMMMMMMMMMMMIMMMM MIMM 

GGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCC 1221 
TGCTCCTCATGGCCACGCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCC 84 6 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

TGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCC 1281 
GAGCCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACT 906 

I I MMMMMMMMMI 

GA GCCCTGGATACCAAGTACT 13 02 

GCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGG 966 

MMMMMIMMMMMMMMMMMMMMMMMMMMM MMMI 

GCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGG 1362 
ACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGC 102 6 

I I M II II I Ml II I II M I Ml III III M M 1 1 1 1 MM 1 1 M II 1 1 II I M III III 

ACCTGGGCTGGAAGTGGATTCATGAAGCCAAGGGCTACCATGCCAATTTCTGCCTGGGGC 1422 
CCTGTCCCTACATCTGGAGCCTAGACA.CTCAGTACAGCAAGGTCCTGGCTCTGTACAACC 1086 

MMMI II Ml III II III III Ml MMI MM III M II I MM II MM MM III 

CCTGTGCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACC 14 82 
AGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCGGCAGGCGCTGGAGCCACTGC 114 6 

III MM II Ml III M I II III III MIMIMI II I II M I M MM III I Ml I III 

AGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGC 1542 
CCATGGTGTACTACGTGGGCCGCAAGCGCAAGGTGGAGCAGCTGTGCAACATGATCGTGC 12 06 

III ilM II Ml III M I II MM II III M II II IM IM 1 1 M III MM III MM 

CCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGC 1602 
GTTCGTGCAAGTGCAGCTGAGGCCCCGGCCCGCCCACAGCCCCGGCCACCCGGCAGGCCC 12 66 

MMMMMIMMMMMMMMMMMMMMMMMMMMM MMMI 

GTTCCTGCAAGTGCAGCTGAGGGCCCGCCCCGCCCACAGGCCCGCCCACCCGGCAGGCCC 1662 
GGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCGCAAGCCCAC 132 6 

M I Ml 1 1 1 1 II 1 1 II 1 1 1 M M II 1 1 1 1 1 M M 1 1 II I II I III II Ml M II I ! 1 1 

GGCCCCAGGCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCCAC 1722 



1 1 1 1 1 M I 1 1 1 1 1 1 i I I M 



RESULT 



4 



SSTGFBR 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 
FEATURES 

source 



CDS 



1605 bp mRNA linear 
mRNA for transforming growth factor-beta 



MAM 27-MAR-1995 
TFG) precursor. 



mat_peptide 



SSTGFBR 
Porcine 
Y00111 

Y00111.1 GI:2129 
transforming growth factor-beta. 
Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; Sus. 
1 (bases 1 to 1605) 
Derynck,R. and Rhee,L. 

Sequence of the porcine transforming growth factor-beta precursor 

Nucleic Acids Res. 15 (7), 3187 (1987) 

87174844 

3470708 

Locat ion/ Qual i f iers 
1. .1605 

/organism= "Sus scrofa" 
/mol_type=" mRNA 11 
/db_xref="taxon: 9823" 
/tissue_type=" ova r y " 
/clone_lib=" lambda gtlO" 
404. .1576 

/note="TFG-beta precursor (AA 1-390)" 

/codon_start=l 

/protein_id="CAA68291 .1" 

/db_xref="GI :2130" 

/db_xref ="SWISS-PROT: P07200" 

/translations "MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMLESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS " 
1238. .1573 



misc feature 



misc feature 



BASE COUNT 
ORIGIN 



/product= "mature 
647. .655 
/note="pot . 
809. .817 
/note="pot . 
298 a 572 c 



TFG-beta" 



N-glycosylation site" 



N-glycosylat ion site" 
437 g 298 t 



Query Match 84 . 8%; 

Best Local Similarity 96.3%; 
Matches 1200; Conservative 



Score 1146.8; DB 4; 
Pred. No. l.le-192; 
0; Mismatches 7; 



Length 16 05; 
Indels 39; Gaps 



l; 



Qy 

Db 



1 0 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 9 

ll III I ! 1 1 1 1 ! t ' l I ! IM 1 1 1 : l [ 1 1 1 1 1 1 1 1 1 1 : l [ 1 1 1 

3 99 CCCCCATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 4 58 



Qy 



Db 



70 



459 



T AGTG CTG A CG CCTGG C CGG C CGG C CG C CGGA CTGT C CAC CTG CAAG A CCAT CG A CATGG 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 



129 



518 



Qy 


130 


Db 


519 


Qy 


190 


Db 


579 


Qy 


250 


Db 


639 


Qy 


310 


Db 


699 


Qy 


370 


Db 


759 


Qy 


430 


Db 


819 


Qy 


490 


Db 


879 


Qy 


550 


Db 


939 


Qy 


610 


Db 


999 


Qy 


670 


Db 


1059 


Qy 


730 


Db 


1119 


Qy 


790 


Db 


1179 


Qy 


850 


Db 


1238 


Qy 


910 


Db 


1260 



AG CTGGTGAAG CGGAAG CGCATCGAGG C CATT CG CGG C CAGATTCTGTCCAAG CTTCGG C 18 9 

1 1 , 1 1 M 1 1 1 1 1 1 1 i 1 1 1 1 1 ! I i 1 1 Ml 1 1 i 1 1 1 1 II 1 1 1 1 1 1 1 1 M I II 1 1 M I M 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 578 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I 1 1 1 1 M ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 : 1 1 1 MM 1 1 1 1 1 1 1 1 1 1 1 1 MMMI 

TCGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGG 63 8 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 0 9 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 698 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 369 

MMMMMMMMMMMMMMMMMMIMMMI 1 1 1 1 M 1 1 1 1 1 1 1 1 1 

CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGCTGGAAAGCGGCAACC 758 
AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 9 

I M . 1 1 1 1 M ! 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 M 1 II M 1 1 1 M 1 1 1 M 1 1 1 1 

AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 818 
AGGTGCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 9 

1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II II 1 1 Ml 1 1 1 M M 1 1 1 1 II 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 8 78 
GGCTC^AGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTGCT 54 9 

1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 M M I M II 1 1 II 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 II 

GG CT CAAGTTAAAAGTGG AG CAG CA CGTGG AG CTATA C CAG AAATA CAG CAATG ATT C CT 93 8 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 609 

MMMM MMMMIMMMMIIMM MM MMMMMMMMIIMM 

GGCGCTAGCTCAGCAACCGGCTGCTGGCCCGCAGTGACTCACCGGAGTGGCTGTCCTTTG 998 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

II 1 1 1 1 M 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M II II II Ml 1 1 M 1 1 1 1 M I M 

ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 1058 
TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 729 

II 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 Ml 1 1 II I II M 1 1 1 M 1 1 1 Ml I II 

TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 1118 
TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 9 

1 1 M II 1 1 II I II 1 1 II I II M M 1 1 1 III 1 1 1 II I III 1 1 1 1 1 II M I MM 1 1 1 1 I 

TCAATTCTGGGCGGCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 117 8 
TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 9 

1 1 II II 1 1 II I II I II 1 1 1 1 M M 1 1 II II M II I MM I M I MM 1 1 M M I M 1 1 

TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGA - 1237 
CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 909 

MMMMMMMMMIMI 

GCCCTGGATACCAACTACTGCT 12 5 9 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 96 9 

1 1 1 1 M M 1 1 1 Ml 1 1 1 1 1 1 1 1 1 M 1 1 Ml 1 1 1 II I III 1 1 1 1 1 1 1 1 M MM I II I II 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 1319 



Qy 



97 0 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 9 



1 1! I M I II I II I M I II M III 1 1 1 1 1 M III M I II II 1 1 1 1 Ml 1 1 1 1 Ml 1 1 II 

Db 132 0 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 137 9 

Qy 1030 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 9 

1 1 Ml M 1 1 1 1 1 1 1 1 1 1 II Ml 1 1 1 1 1 II II I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 13 80 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 143 9 

Qy 10 90 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

11 lllllll Ml IIMIMM II Mill MIM I MM Mill MM 

Db 14 4 0 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 14 99 

Qy 115 0 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 0 9 

Mill M IMIMIMM IIIIIIIIIIIMIMMIIM II II IIMIMM 

Db 1500 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 155 9 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCAC 12 55 

1 1 : 1 1 : 1 1 1 1 : 1 1 ! ] 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 j 1 1 1 1 1 1 1 i 

Db 1560 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCAC 1605 



RESULT 5 
AF461808 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



AF461808 2221 bp mRNA linear MAM 03-JAN-2002 

Sus scrofa transforming growth factor beta 1 (TGFB1) mRNA, complete 
cds . 

AF461808 

AF461808 .1 GI : 18042250 

Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 

1 (bases 1 to 2221) 

Wimmers,K., Chomdej,S., Ponsuksil i , S . and Schellander, K. 
Polymorphism in the porcine transforming growth factor beta 1 gene 
Unpublished 

2 (bases 1 to 2221) 

Wimmers,K., Chomde j , S . , Ponsuksil i , S . and Schellander , K. 
Direct Submission 

Submitted (20-DEC-2001 ) Institute of Animal Breeding Science, 
University of Bonn, Endenicher Allee 15, Bonn 53115, Germany 

Loca t ion/ Qua 1 i f iers 

1. .2221 

/organism="Sus scrofa" 

/mol_type="mRNA" 

/db_xref ="taxon: 9823" 

/chromosome^ "6" 

1. .2221 

/gene= M TGFBl" 

1. .1173 

/gene= " TGFB1 " 

/not e= " cytokine " 

/codon_start=l 

/product =" transforming growth factor beta 1" 
/protein_id="AAL57902 .1" 
/db_xref ="GI : 18042251" 

/ translat ion= "MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 



3 ' UTR 

BASE COUNT 
ORIGIN 



RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
Q YSKVLAL YNQHN PGAS AAP CCVPQALE PLP I VYYVGRKPKVEQLSNM I VRS CKCS 11 
1174. .2221 
/ gene= "TGFB1 11 
504 a 616 c 649 g 452 t 



Query Match 83 . 2%; 

Best Local Similarity 96.3%; 
Matches 1178; Conservative 



Score 1125.4; DB 4; 
Pred. No. 6.1e-189; 
0; Mismatches 6; 



Length 2221; 
Indels 39; Gaps 



1; 



Qy 


15 


Db 


1 


Qy 


75 


Db 


61 


Qy 


135 


Db 


121 


Qy 


195 


Db 


181 


Qy 


255 


Db 


241 


Qy 


315 


Db 


301 


Qy 


375 


Db 


361 


Qy 


435 


Db 


421 


Qy 


495 


Db 


481 


Qy 


555 


Db 


541 


Qy 


615 



ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 74 

III 1 1 1 1 1 1 1 1 ! 1 1 1 II 1 1 1 1 II M 1 1 1 1 1 Tl I M I II 1 1 1 1 1 II 1 1 1 1 1 1 II ll 1 1 

ATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 
CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 134 

IIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 
GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 194 

! 1 1 i 1 1 1 , I i - 1 1 1 1 1 1 1 1 Ml 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTCGCC 18 0 
AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 254 

I! 1 1 1 i 1 1 1 1 1 1 II I II : II I II 1 1 1 M ! 1 1 1 1 1 II Ml 1 1 M 1 1 llllllllllll 

AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 24 0 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 314 

I II : 1 1 1 1 1 , 1 1 '1 1 1 II 1 1 1 ! II M 1 1 1 1 1 1 1 1 1 II I M M 1 1 1 II ' 1 1 1 1 h i II 1 1 1 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 
GCGGACTACTACGCC^GGAGGTC^CCCGCGTGCTAATGGTGGAAAGCGGC^CCAAATC 3 74 

IIIIMMIIIMI! II IMIIIII IMMMIMMMMMMM Mill IM 

GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 6 0 
TATGATAAATTCAAGGGC^CCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 434 

I II M 1 1 1 1 M Ml 1 1 II I II MM 1 1 1 1 1 1 1 1 II Ml II I M I MM II M I M II II I 

TATGATAAATTC^AGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 4 2 0 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 94 

I II M 1 1 1 1 M 1 1 II I II 1 1 1 1 II M 1 1 1 1 M II I M Ml I M I Ml M 1 1 II M II I II 

CGGGAAG CGGTG C CGGAAC CTGTATTGCT CTCTCGGG CAGAG CTG CG CCTG CTGAGG CT C 4 8 0 



I II M I 1 1 1 1 1 M I Ml M I M 1 1 1 1 I M 1 1 II II I III II 1 1 III 1 1 1 M MM M 



TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 614 

I III 1 1 1 1 1 M I M 1 1 II I II II II M I M I MM II I II I M 1 1 1 1 1 1 1 1 1 III II 

TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 6 0 0 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 674 

I M 1 1 1 II 1 1 1 1 1 II I III II I II 1 1 1 1 1 M M I Mill 1 1 II I II 1 1 ll 1 1 1 MMM 



Db 



601 



ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 



660 



Qy 


675 


Db 


661 


Qy 


735 


Db 


721 


Qy 


795 


Db 


781 


Qy 


855 


Db 


835 


Qy 


915 


Db 


862 


Qy 


975 


Db 


922 


Qy 


1035 


Db 


982 


Qy 


1095 


Db 


1042 


Qy 


1155 


Db 


1102 


Qy 


1215 


Db 


1162 



675 GCCCACTGTTCCTGTGACAGC^AAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 734 

I M I II I M II II I II I M 1 1 1 II II II I M I M I II II II 1 1 1 II 1 1 II I II I M II II 

661 GCCC^CTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 



TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGGTCCTC 7 94 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 854 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGA 834 



855 GATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGC 914 

I II II 1 1 II I M II I II 1 1 1 1 II II 1 1 

83 5 GCCCTGGATACCAACTACTGCTTCAGC 861 



TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGC 

I IMIillllllllllllll MINI IlihIIIIIIIM llll'lllllllllll 

TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGC 



974 



921 



TGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCC 1034 

1 1 1 1 1 1 I III 1 1 M M 1 1 1 1 1 1 1 II Ml I I 1 1 1 II 1 1 1 1 1 1 1 1 1 II! 1 1 , 1 1 1 1 1 1 

TGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCC 981 
TACATCTGGAGCGTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAAC 1094 

INN MINI Ml MM Ml Ml MINI IN MM Mill Ml MINI Ml MM II 

TACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAAC 1041 
CCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 1154 

I M 1 1 M 1 1 II II i I II 1 1 1 1 , 1 1 1 II I I III 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 Ml I II 

CCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 1101 
TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATGGTGCGTTCCTGC 1214 

1 1 1 1 1 M I III 1 1 1 1 M M I M 1 1 1 II I M M 1 , 1 1 1 II 1 1 1 1 1 1 1 I II 1 1 1 1 M I II 

TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGC 1161 
AAGTGCAGCTGAGGCCCCGCCCC 123 7 

1 1 1 1 1 ' I III 1 1 1 1 1, 1 M 

AAG TG CAG CTG AGG C C C CAAG C C 118 4 



RESULT 6 

DOGTGFB1A 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



linear MAM 30-OCT-1994 
(TGFB1) mRNA, complete 



DOGTGFB1A 13 69 bp mRNA 

Canine transforming growth factor-beta 1 
cds . 
L34956 

L34956 .1 GI : 516071 

homologue; transforming growth factor-beta 1. 
Canis familiaris (dog) 
Canis familiaris 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; 
1 (bases 1 to 1369) 

Manning, A.M. , Auchampach, J . A. , Drong,R.F. and Slightom,J 



Euteleostomi ; 
Canis . 



TITLE 

JOURNAL 
COMMENT 

FEATURES 

source 



gene 



5 ' UTR 



CDS 



mat_peptide 



3 'UTR 



BASE COUNT 
ORIGIN 



Cloning of a canine cDNA homologous to human transforming growth 
factor-beta 1 (TGFbetal) 
Unpublished (1994) 

Original source text: Canis familiaris adult jugular vein 
endothelial cDNA to mRNA. 

Location/Qualifiers 

1. .1369 

/organism^ "Canis f amil laris " 

/mol_type=" mRNA" 

/db_xref ="taxon: 9615" 

/eel l_type=" LPS -activated" 

/tissue_type=" jugular vein endothelial" 

/dev_stage= " adult " 

1. .1369 

/gene="TGFBl" 

1. .57 

/gene="TGFBl" 
58. .1230 
/gene=" TGFB1" 

/ funct ion=" ant i- inflammatory agent" 

/not e= "precursor" 

/codon_start=l 

/product=" transforming growth factor-beta 1" 
/protein_id="AAA51458 .1" 
/db_xref="GI : 516072" 

/translation= " MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLSSPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVENTNKI YEKVKKS PHS I YMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKAEQHVELYQKYSNDSWRYLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFR 
LSAHCSCDSKDNTLQVDINGFSSSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQ 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS " 
1039. .1227 
/gene="TGFBl" 

/product=" transforming growth factor-beta 1" 
1231. .1369 
/gene="TGFBl" 
264 a 473 c 415 g 216 t 1 others 



Query Match 74.3%; Score 1005.8; DB 4; 

Best Local Similarity 87.5%; Pred . No. 8.5e-168; 
Matches 1150; Conservative 0; Mismatches 123; Indels 



Length 13 69; 

41; Gaps 



3; 



Qy 

Db 



1 0 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 9 

II 111 1 1 1 1 II MM II II llllllllllilllllillllll 1 1 1 1 I 

53 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTGCCGCTGCTGCGGCTAC 112 



Qy 

Db 

Qy 
Db 



7 0 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 M I M I M 1 1 1 1 1 1 1 1 1 1 1 1 i I ] 1 1 1 1 1 M 1 1 1 1 1 1 1 M II 

113 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 172 

13 0 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

I IIIIIIIIIIIIIIIIIIIIIMIIIIII lllllllllll lllllllllll I I I I 

173 AACTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 232 



Qy 



190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 



I 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 i I ! Illlllllll llllllll Mill II MM 

233 TCTCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGTGCCGCTGCCCGAGGCCGTGCTGG 



Db 


233 


T CT C GAG C C C C C CG AG C CAGGGGG AGGTG C CG C C CGTG CCG CTG C C CG AGG C CGTG CTGG 


292 


Qy 


250 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

1 II llllllll IIIIIIIMIIIII II Mill II 1 III lllllllllllll 
C C CTCTACAA CAG CA CC CG CG A C CGGG TGG CGGGGGAGAG CG C CGAG C CGG AG C CCGAG C 


309 


Db 


293 


352 


Qy 
Db 


310 
353 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 Ml III II 1 IMMM III 1 II II 1 II IMMM IM 1 II MM M 1 1 MM 

CCGAGGCGGACTACTACGCO^GGAGGTCACCCGCGTGCTAATGGTGGAAAAO^CCAACA 


369 
412 


Qy 


370 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

Illlllllll III Mill 1 II MIMI Illlllllll llllllll II 1 

AAATCTATGAGAAAGT CAAGAAAAGT CCG CACAG CATATATATG CTCTT CAA CA CAT CAG 


429 


Db 


413 


472 


Qy 
Db 


430 
473 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

IMMM Mill Mill II II II MMIIII IMIIIIII IMIMMMM 

AGCTCCGAGAAGCAGTGCCTGAGCCCGTCTTGCTCTCCCGGGCAGAGTTGCGCCTGCTGA 


489 
532 


Qy 


490 


GGCTC^GTTAAAAGTGGAG(^GCACGTGGAGCTATAC(^GAAATAC^GCAATGATTCCT 

MMI llllllll IMIIIIII MMIIII IIMIMIMI lllllllllllll 

GGCTCAAGTTAAAAGCGGAGCAGCATGTGGAGCTGTACCAGAAATATAGCAATGATTCCT 


549 


Db 


533 


592 


Qy 


550 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

1 1 1 1 1 M 1 1 1 1 II 1 II M 1 1 1 1 1 1 1 1 Mill III 1 II II MIMMIIMM 

GGCGCTACCTCAGCAACCGGCTGCTGGCGCCCAGCGACACGCCAGAATGGCTGTCCTTTG 


609 


Db 


593 


652 


Qy 


610 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

MIMI Mill Ml IMIMMMM II III II 1 Mill IMMM 

ATGTCACTGGAGTCGTGAGGCAGTGGCTGAGCCATGGAGGGGAAGTCGAGGGCTTTCGCC 


669 


Db 


653 


712 


Qy 


670 


TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

ill MMIMMIMM II IIIIIIMIMMMMI II M II IIIIMIIII 

TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACA^ 


729 


Db 


713 


772 


Qy 


730 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

III III IMMM 1 1 1 1 1 II II 1 1 1 M 1 II M lllllllllll MIIIIMM 

TCAGTTCCAGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGACCCTTCCTGC 


789 


Db 


773 


832 


Qy 


790 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

1 1 1 II 1 1 1 1 III 1 1 1 II 1 1 1 1 M 1 M 1 Ml 1 Ml IM 1 II 1 II 1 1 1 M 1 Mill 

TC CTCATGGCCACCCCACTGGAGAGGGCC CAG CAC CTG CACAG CTCCCGG CAG CG CCG - - 


849 


Db 


833 


890 


Qy 


850 


CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 

IMIIIIII MIMMIIMM 

GGCCCTGGACACCAACTACTGCT 


909 


Db 


891 


913 


Qy 


910 


TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 

1 1 1 1 1 1 II M 1 1 II 1 1 1 II 1 II 1 1 II 1 1 MMMMMIMIIIIMIMI Mill 1 
TCAGCTCCACGGAGAAGAACTGCTGCGTCCGGCAGCTCTACATTGACTTCCGCAAGGATC 


969 


Db 


914 


973 


Qy 


970 


TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 


1029 


Db 


974 


1 1 M 1 M 1 1 II M 1 M Mill llllllll Mill II II 1 1 II 1 1 1 1 II IM 1 1 

TGGGCTGGAAGTGGATCCATGAGCCCAAGGGTTACCACGCTAACTTCTGCCTGGGGCCCT 


1033 


Qy 


1030 


GTCCCTAC^TCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTAC^ACCAGC 


1089 



I llllllll llllllll Mill IM MM I MIMI I M I M MIMMIIMM 



Db 



1034 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 10 93 



Qy 1090 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

II III llllll IMMMMIMI MMII MMMMIMIMIMMMIMI 

Db 10 94 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 1153 

Qy 1150 TCGTGTACTACGTGGGCCGCAAGCCGAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 1209 

II MIMIMII llllll MMMMIMIMMMMM I IIMMIIIIII I 

Db 1154 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCGAACATGATCGTGCGCT 1213 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCG-CCCACAGCCCCGCCCACCCGGCAGGCCCGG 1268 

II III MMII llllll IMIIMI II III III 1 1 I llllll 1 1 1 1 

Db 1214 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGTCCGGCAGGCCCCGCCCACCGGCAGGNCCGG 1273 

Qy 12 6 9 CCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGC 1322 

I 1 1 I IMIIIIIIII I II 1 1! 1 1 1 1 1 Ml 1 1 1 1 1 M llllll II 

Db 1274 CCCCGCCCCCGCCCGCTGCGCC-GGGCTGTATTTAAGGACACCGCGCCCCAGGC 132 6 



RESULT 7 

E00973 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



E00973 2527 bp RNA 

cDNA encoding human TGF-beta. 

E00973 

E00973 . 1 GI :2169234 
JP 1986219395-A/l . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 2527) 
Riku, M . A. D . D . and Debit sudo , B . G . 
NUCLEIC ACID CODED WITH TGF-BETA AND ITS 
Patent: JP 1986219395-A 1 29-SEP-1986; 
GENENTECH INC 
OS 
PN 
PD 
PF 
PR 
PI 



linear PAT 29-SEP-1997 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



USE 



human 

JP 1986219395-A/l 
29-SEP-1986 

20-MAR-1986 JP 1986064661 
22-MAR-1985 US 85 715142 

RIKU MAIKERU ANDORE DERINKU, DEBITSUDO BANNOOMAN GETSUDERU 
C12P2l/00,C12Nl/00,C12N5/00,C12N15/00//C12Ql/68, (C12P21/00, PC 
C12R1 : 91) , 



PC 



PC 

cc 
cc 
cc 
cc 
cc 

FH 
FT 
FT 
FT 
FT 
FT 
FT 



(C12Nl/00 / C12Rl : 19) , (C12N5/00 , C12R1 : 91 ) , (C12N15/00 , C12R1 : 91) ; 
strandedness : Double; 
topology: Linear; 
hypothetical: No; 
anti-sense: No; 

*source: t issue_type=placenta and glyoblastoma ; FH Key 
Locat ion/ Qua 1 if iers 



CDS 

mat_peptide 

5 'UTR 
3 'UTR 



842. .2014 

/product= 'pre TGF-beta' 
1676. .2011 
/product^ ' TGF-beta ' 
1. .841 
2015. .2537 



FT stem_loop 37. .113 

FT stem_loop 2015. .2100 

FT polyA_site 2514. .2519. 

FEATURES Locat ion/Qual i f iers 

source 1. .2527 

/organism= M Homo sapiens" 
/mol_type= "genomic RNA" 
/db_xref = n taxon: 96 06" 
BASE COUNT 472 a 888 c 735 g 432 t 

ORIGIN 

Query Match 73.5%; Score 994.8; DB 6; Length 2527; 

Best Local Similarity 85.9%; Pred. No. 6.9e-166; 

Matches 1144; Conservative 0; Mismatches 147; Indels 41; Gaps 2; 

CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 69 

II III 1 1 1 I II IIIMIIIIII M Mill III MMIIMIMMI I 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 
TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

I 1 1 1 1 1 : 1 1 ! 1 1 1 1 1 1 1 MINIMI Mill Mill Ml M MIIMIMI 

TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 95 6 
AG CTGGTGAAG CGG AAG CG CAT CGAGG CCATTCG CGG C CAG ATT CTGT C CAAG CTT CGG C 18 9 

I M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 III I I II 1 1 1 IIIMIIIIII 1 1 ; 1 1 i 1 1 1 1 MM 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I MMIM IM III M II II I Ml II M II 1 1 1 M I Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 309 

I II MUM MMIIMIMMI MINIM MM 1 1 1 1 1 II II 1 1 1 II 1 1 

CCCTCTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 6 9 

I Mill Ml III III IMIIMM II II I MMIM Mill! Ill I I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 
AAATCTATGATAAATTC^GGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 9 

1 1 1 1 1 1 1 : 1 ! II MUM I I MINI MM I MMIM II I 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGC^ 12 56 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 9 

MMIM MIIIMI II Mill II 1 1 [ 1 1 } 1 1 MM MM MMIM 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 
GGCT(^GTTAAAAGTGGAGC^GCACGTGGAGCTATACC^GAAATA(^GaVATGATTCCT 54 9 

MIMMMMM MMM M M Mill II 1 1 1 1 1 1 II 1 1 II 1 1 1 MMM 

GGCTC^GTTAAAAGTGGAGCAGCACGTGGAGCTGTACC^GAAATAC^GC^CAATTCCT 1376 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 6 09 

MM M 1 1 1 1 1 1 1 M I Ml II 1 1 1 Mill Mill II MMM I II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1436 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

II 1 1 II II 1 1 1 II 1 1 1 II 1 1 1 II 1 1 1 III III III II II Mill IMIMI 



Qy 


10 


Db 


837 


Qy 


70 


Db 


897 


Qy 


130 


Db 


957 


Qy 


190 


Db 


1017 


Qy 


250 


Db 


1077 


Qy 


310 


Db 


1137 


Qy 


370 


Db 


1197 


Qy 


430 


Db 


1257 


Qy 


490 


Db 


1317 


Qy 


550 


Db 


1377 


Qy 


610 



Db 



1437 



ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 



Qy 


670 


Db 


1497 


Qy 


730 


Db 


1557 


Qy 


790 


Db 


1617 


Qy 


850 


Db 


1676 


Qy 


910 


Db 


1698 


Qy 


970 


Db 


1758 


Qy 


1030 


Db 


1818 


Qy 


1090 


Db 


1878 


Qy 


1150 


Db 


1938 


Qy 


1210 


Db 


1998 


Qy 


1270 


Db 


2056 


Qy 


1330 


Db 


2116 



TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 729 

I II llllllll lllllllllllll lllllllllll II INN II lllllll 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1556 



I I I llllllll Mllllllllllllllllll llllllllllllll lllllll 



T CCTCATGG C CA CC C CG CTGGAGAGGGC CCAG CA C CTG CA CAG CTCC CGG CAC CG CCGAG 84 9 

I 1 1 1 1 ! I 1 1 1 Ml ! 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 Mill IIIIIIIIIIIIIIIIII 

TTCTCATGG C CACC C CGCTGGAGAGGG C CCAG CAT CTGCAAAG CTC C CGG CA C CG CCGA - 1675 



C CCTGG ATA C CAACGA CTA CAAGG ATGA CGACGA CAAGG C C CTGG ATAC CAACTACTG CT 

llllllll llllllll MM 



909 



1697 



969 



TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 

1 1 1 1 i 1 : 1 1 1 1 11 1 1 1 1 1 1 1 1 , 1 1 1 1 1 1 1 1 I Ml lllllllllllll lllllll 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1757 
TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 9 

I 1 1 1 1 1 1 1 1 1 1 1 1 M II II I II 1 1 M II 1 1 1 1 1 II I II I llllllll lllllll 

TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1817 
GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 9 

I llllllll llllllll Mill Mllllllllllllllllll MMIMMIMI 

GCCCCTACATTTGGAGCCTGGACACGCA.GTACAGC^AGGTCCTGGCCCTGTACAACCAGC 1877 
ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I lllllllllll MMMMMMMMMIMMIMMMMMIMI lllllll 

ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 
TCGTGTACTACGTGGGCCGC^GCCO^GGTGGAGCAGCTGTCC^ACATGATCGTGCGTT 1209 

M ! MM Mill II MM Ml I I 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II I 

TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGGAGCTGTCCAACATGATCGTGCGCT 1997 

CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1269 

Illllllllllllllllll lllllllllll I lllllllll II III 
CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCC - - CCGCCCCGCCCGGCCCCACCCCGCCCC 2 05 5 

CCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCCACTTG 132 9 

II II I III I II MMIMMIMI llllllllllllllll II 

GCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAGCCCACCTG 2115 



I II 



RESULT 8 

A06669 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 



A06669 2537 bp 

Synthetic mRNA for preTGF-Betal . 
A06669 

A06669. 1 GI :412940 



mRNA 



linear PAT 29-JUL-1993 



KEYWORDS 
SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



synthetic construct 
synthetic construct 
artificial sequences . 
1 (bases 1 to 2537) 

NUCLEIC ACID ENCODING TGF- beta 3 AND ITS USE 
Patent: WO 8912101-A 4 14-DEC-1989; 

Location/Qualifiers 

1. .2537 

/organism="synthetic construct" 

/mol_type=" mRNA " 

/db_xref="t axon: 32 63 0" 

842. .2014 

/ codon_start=l 

/trans l_table=ll 

/product="preTGF-betal" 

/protein_id="CAA00588 .1" 

/db_xref ="GI : 412 941" 

/translati on = " M P PSGLRLLPLLLPLLWLLVLTPG P PAAGLSTCKTI DMEL VKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEWRVLMVETHNEI YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS" 
473 a 893 c 739 g 432 t 



Query Match 73 . 5%; 

Best Local Similarity 85.7%; 
Matches 1148; Conservative 



Score 994; DB 6; Length 2 537; 
Pred. No. 9.6e-166; 
0; Mismatches 145; Indels 47; 



Gaps 



2; 



Qy 



Db 



1 0 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 9 

II III I I I I II 1111:111 MINN Mill lllllllll III I 
837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



7 0 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

I 1 1 . 1 1 1 1 1 1 ! 1 1 1 1 1 lllllllll Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 



130 



189 



AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIMMIII MM 

957 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 



190 



249 



TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

I I M 1 1 1 II M II I M 1 1 M M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill M M I 

1017 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 



250 



309 



CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

I M 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1077 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 113 6 



310 



369 



CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

I Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I MM 

1137 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 



Qy 3 70 AAATCTATGATAAATTCAAGGGC^CCCCCCACAGCTTATATATC 42 9 

IMIIIIIII II MINI I I llllll IIMIIII I IIMIIII M I 

Db 1197 AAATCTATGACAAGTTCAAGC^GAGT^ 12 5 6 

Qy 4 30 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 48 9 

MMMI IN" M Mill II MMMM IIMIIIMIMII lllllll 

Db 12 57 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 

Qy 4 90 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 9 

1 1 ! i 1 1 1 1 . 1 1 , i 1 1 ; 1 1 . 1 1 . 1 1 1 : i iiiiiMMiiiiiiii iiiiii 

Db 1317 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 1376 

Qy 55 0 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 609 

MM I M 1 1 1 M 11 1 M 1 1 M 1 1f M ! Mill lllll II MUM I II MM 

Db 13 77 GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1436 

Qy 610 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III Ml Ml II II Mill MMMI 

Db 143 7 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 

Qy 67 0 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATA^ 729 

I II IIMIIII 1 1 1 : .' 1 1 1 1 1 1 1 ! , 1 1 1 1 1 1 1 1 j M Mill II MMMI 

Db 14 97 TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1556 

Qy 73 0 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 9 

III I I IIMIIII MIMMIIIIIIIIIIIM IMMIMMIMI lllllll 

Db 1557 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 

Qy 79 0 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 9 

I MIMIIIIMIIMIIIIIIIIIMIIIIII Mill 1 1 ! 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

Db 1617 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 

Qy 850 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 909 

IIMIIII IIMIIII MM 

Db 167 6 GCCCTGGACACCAACTATTGCT 1697 

Qy 910 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 969 

M 1 1 1 1 1 1 1 1 1 i I ! 1 1 ! 1 1 1 1 1 1 1 1 1 ! i I i 1 1 1 1 1 i I MMMMIIMM MMMI 

Db 16 98 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1757 

Qy 97 0 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 9 

I MMMMIIMM II II MIMMMMMMIMM MMMM MMMI 

Db 17 58 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1817 

Qy 103 0 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 9 

I MMMM MMMM Mill II II 1 1 II I II I II II II 1 1 1 1 1 1 1 ! 1 1 1 1 1 < I 

Db 1818 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 1877 

Qy 1090 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I M II 1 1 1 M II MMIMMMMMMIMIIIMIIIIIIIMMII MMMI 

Db 18 78 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

Qy 1150 TCGTGTACTACGTGGGCCGC7VAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 1209 

1 1 ii 1 1 1 ii 1 1 1 1 r m 1 1 1 1 1 m 1 1 1 1 m i ii r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m r 1 1 1 1 1 r 1 1 i 

Db 193 8 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1997 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 12 6 9 



II 



1 1 1 1 



MINIMI II 



I I I Mill I 



Db 


1998 


CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 


2057 


Qy 


1270 


CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 


1321 


Db 


2058 


III MM Ml 1 1 II MMMMMMMMMI MMMMMI 

CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAG 


2117 


Qy 


1322 


CCCACTTGGGATCGATTAAA 1341 




Db 


2118 


MMI IMI 1 1 1 1 
CCCACCTGGGGCCCCATTAA 2137 





RESULT 9 

OATGFB1 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



Chordata; Craniata; Vertebrata; Euteleostomi ; 
Cetart lodactyla ; Ruminantia; Pecora; Bovoidea; 
Ovis . 



mat^peptide 



OATGFB1 1173 bp mRNA linear MAM 18 -APR- 1995 

O.aries mRNA for transforming growth factor-beta I. 

X76916 

X76916. 1 GI :496648 

TGF-beta 1; transforming growth factor-beta 1. 
Ovis aries (sheep) 
Ovis aries 
Eukaryota ; Metazoa , 
Mammalia; Eutheria , 
Bovidae ; Capr inae ; 
1 

Woodall , C. J. , McLaren,L.J. and Watt, N.J. 

Sequence and chromosomal localisation of the gene encoding ovine 
latent transforming growth factor-beta 1 
Gene 150 (2), 371-373 (1994) 
95121932 
7821809 

2 (bases 1 to 1173) 
Woodall , C. 
Direct Submission 
Submitted (24 -DEC-1993) C. 
Veterinary Pathology, Sc. 
Edinburgh EH 9 IQH, UK 

Location/Qual if iers 

1. .1173 

/organism="Ovis aries" 
/mol_type="mRNA" 
/db_xref ="taxon: 9940" 
1. .1173 
/codon__start = l 

/product = 11 transforming growth factor-beta 1" 
/protein_id="CAA54242 . 1" 
/db_xref="GI : 496649" 
/db_xre f = " SWI SS - PROT : P5 04 14 " 

/translation=" MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKTI DMELVKRK 
GIEAIRGQILSKLRLASPPSQGDVPPGPLPEAILALYNSTRDRVAGESAETEPEPEAD 
YYAKEVTRVLMVEYGNKI YDKMKSSSHSI YMFFNTSELREAVPEPVLLSRADVRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEIEGFR 
LSAHCSCDSKDNTLQVDINGFSSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCC^RQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS " 
835. .1170 

/product=" transforming growth factor-beta 1" 



Woodall, Univ. of Edinburgh, Dept. of 
of Vet. Studies, Univ. of Edinburgh, 



BASE COUNT 245 a 378 c 336 g 214 t 

ORIGIN 

Query Match 73.0%; Score 988; DB 4; Length 1173; 

Best Local Similarity 89.8%; Pred . No. 1.2e-164; 

Matches 1088; Conservative 0; Mismatches 85; Indels 39; Gaps 



Qy 


15 


Db 


1 


Qy 


75 


Db 


61 


Qy 


135 


Db 


121 


Qy 


195 


Db 


181 


Qy 


255 


Db 


241 


Qy 


315 


Db 


301 


Qy 


375 


Db 


361 


Qy 


435 


Db 


421 


Qy 


495 


Db 


481 


Qy 


555 


Db 


541 


Qy 


615 


Db 


601 


Qy 


675 


Db 


661 


Qy 


735 



lllllllllllllllllll IIIIIIMIMIIMIIIIIIIIIIIIIIIIIIIIMIII 

CTGACGCCTGGCCGGCCGGTCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 
GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

Mllllllllil MIMMMMM Mill MINI lllllll MINIM Ml 



MUM 1 1 1 1 1 1 1 1 1 1 1 1 1 , 1 1 f 1 1 1 f 1 1 1 1 1 1 1 1 MM lllllll MINIMI 

TACAACAGTACCCGCGACCGGGTGGCCGGGGAAAGTGCCGAAACGGAGCCTGAGCCAGAG 
GCGGACTACTACGCC^GGAGGTC^CCCGCGTGCTAATGGTGGAAAGCGGC^ACCAAATC 

1 1 1 ! 1 1 1 ! 1 1 1 1 If 1 1 1 1 1 1 1 1 1 1 M If 1 1 1 1 1 1 1 1 1 ! I M 1 1 1 1 lllllll Mill 



MM MINIM MMMMMMIMM I ! I M 1 1! ! I f 1 1 If 1 1 

.TGCCTGAACCTGTGTTGCTCTCTCGGGCAGACGTGCGCCTGCTGAGGCTC 
TGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

M 1 1 1 M M 1 1 M I M M I MMIMIMI Mill NNINNII 



M 1 1 1 1 1 1 1 1 M I M 1 1, M 1 1 II 1 M MNMM INM II IIIIMIIIMI 

ACTGGAGTTGTGCGGCAGTGGCTGACCCACAGAGAGGAAATAGAAGGCTTTCGCCTCAGT 
GCCC^CTGTTCCTGTGACAGO^GATAAC^CACTCCACGTGGAAATTAACGGGTTC^T 

NNNIIMNIMIMM II lllllll! II II INN II IMIIIIMI I 

GCCC^CTGTTCCTGTGAC^GTAAGGATAAC^CGCTTCAAGTGGAC^Ta^CGGGTTCAGT 
TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

N II N II N M M II II I I M II M 1 1 1 1 M 1 1 ! M i M M M [ 1 1 ( 1 1 1 1 II 1 1 M 



Db 



721 TCCGGCCGCCGGGGTGACCTCGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 



Qy 7 95 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 8 54 

IIIIIIMIII IIIIMIMMMIIMIIIIIMIMIIIM IIIIIMM 

Db 781 ATGGCCACCCCTCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGCCACCGCCGA- 834 

Qy 8 55 GATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGC 914 

1 1 II (1 1 1 I M 1 1 1 1 II 1 1 1 1 1 1 1 ! I 

Db 83 5 GCCCTGGACACCAACTACTGCTTCAGC 861 

Qy 915 TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGC 974 

Mill II IIIIIIMIII II II 1 , 1 1 1 1 1 1 1 1 1 ; 1 1 1 , 1 1 1 1 1 1 1 i 1 1 1 ! 1 1 1 1 1 

Db 8 62 TCCACAGAAAAGAACTGCTGTGTTCGTCAGCTCTACATTGACTTCCGGAAGGACCTGGGC 921 

Qy 975 TGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCC 1034 

M 1 1 1 1 1 1 1 1 1 1 1 1 MMMMMIMMM II M II M M I M M I M M M M M I 

Db 922 TGGAAGTGGATTCACGAACCCAAGGGCTACCACGCCAATTTCTGCCTGGGGCCCTGTCCC 981 

Qy 1035 TACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAAC 1094 

1 1 1 1 ! 1 1111,11 1 Mill 1 1 , , 1 1 1 1 1 1 Mill! 1 1 1 1 1 1 1 ! 1 1 1 1 i 1 1 1 1 1 1 1 1 ! 

Db 982 TACATCTGGAGCCTGGACACACAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCACAAC 1041 

Qy 10 95 CCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 1154 

Miiiiii 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 ; 1 1 1 ; i n iiimmimm 

Db 1042 CCGGGCGCATCGGCGGCGCCGTGCTGCGTGCCTCAGGCGCTGGAACCCCTGCCCATCGTG 1101 

Qy 1155 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGC 1214 

I M 1 1 M t M M 1 1 1 1 1 M M 1 1 M I M 1 1 1 1 1 1 1 II II I M 1 1 1 1 1 1 1 II I MMM 

Db 1102 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGTTGTCCAACATGATCGTGCGCTCCTGC 1161 

Qy 1215 AAGTGCAGCTGA 122 6 

i r 1 1 m 1 1 1 1 r i 

Db 1162 AAGTGCAGCTGA 1173 



RESULT 10 

BC000125 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC000125 1780 bp mRNA linear PRI 12-JUL-2001 

Homo sapiens, Similar to transforming growth factor, beta 1, clone 
MGC:3119 IMAGE:3351664, mRNA, complete cds . 
BC000125 

BC000125.1 GI :12652748 
MGC. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1780) 
Strausberg,R. 
Direct Submission 

Submitted (03 -NOV-2000) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 
Contact : MGC help desk 
Email : cgapbs -r@mail .nih.gov 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



Tissue Procurement: ATCC 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Genome Sequence Centre, 

BC Cancer Agency, Vancouver, BC, Canada 

inf o@bcgsc . be . ca 

Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, 
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, 
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo 
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven 
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi , Jacqueline 
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, 
Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy, 
George Yang, Scott Zuyderduyn, Marco Marra. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAL Plate: 6 Row: e Column: 11 

This clone was selected for full length sequencing because it 
passed the following selection criteria: Hexamer frequency ORF 
analysis . 

FEATURES Loca t ion/ Qua 1 i f i ers 

source l. .178 0 

/organism="Homo sapiens" 

/mol_type="mRNA " 

/db_xref ="taxon: 9606" 

/clone="MGC:3119 IMAGE: 3351664 11 

/ t issue_type= " Eye , retinoblastoma " 

/ c 1 one_l ib= " N I H_MGC_1 6 " 

/lab_host="DH10B-R" 

/not e= "Vector : p0TB7 " 
CDS 447. .1619 

/codon_start=l 

/product="Similar to transforming growth factor, beta 1" 
/protein_id="AAH00125. 1" 
/db_xref="GI : 12652749" 

/ 1 rans lat ion= " MP PSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEI YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGVVRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS " 

BASE COUNT 341 a 64 8 c 4 93 g 2 98 t 

ORIGIN 

Query Match 73.0%; Score 987.6; DB 9; Length 1780; 

Best Local Similarity 85.4%; Pred . No. 1.4e-164; 

Matches 1144; Conservative 0; Mismatches 149; Indels 47; Gaps 2; 

Qy 10 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 69 

II III I I I I II lllllllllll III lllilllll IIIIIIIIIMIII I 
Db 442 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTAC 5 01 

Qy 70 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

I 1 1 1 1 1 1 ! M 1 1 1 M 1 1 1 T I II 1 1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIM, I 

Db 502 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 561 



Qy 


130 


Db 


562 


Qy 


190 


Db 


622 


Qy 


250 


Db 


682 


Qy 


310 


Db 


742 


Qy 


370 


Db 


802 


Qy 


430 


Db 


862 


Qy 


490 


Db 


922 


Qy 


550 


Db 


982 


Qy 


610 


Db 


1042 


Qy 


670 


Db 


1102 


Qy 


730 


Db 


1162 


Qy 


790 


Db 


1222 


Qy 


850 


Db 


1281 


Qy 


910 


Db 


1303 



AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 Ml 1 1 1 lllllllllll MINIUM MM 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 621 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I MMMMIMMMMMMM 1 1 II 1 1 1 1 1 II II M I M II Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 681 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 309 

I M llllllll 1 1 r 1 1 1 1 1 1 1 1 1 1 1 MINIM MM 1 . 1 1 1 ! M 1 1 1 1 1 1 ! I 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACGGGAGCCCGAGC 741 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 369 

I INN 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIIIIIIIIIIII 1 1 1 1 , I INI 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 8 01 
AAATGTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 9 

Ml I II II MINI I I Mill llllllll I llllllll II I 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 861 
AGCTCCGGGAAGGGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 9 

IIMIII llllllll II INN II IIMIII I I I MMM Mill 

AGCTGCGAGAAGCGGTACCTGAAGCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 921 
GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 9 

I III M I I I I I 1 M M I I I I ; I ; I I I I I I M i llllllll I ! I I I I I I MM 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 981 
GG CGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 60 9 

INI I M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 Ml I Mill INN II WWW I II INI 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1041 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

1 1 1 MM I II I II I III 1 1 1 M 1 1 II III III III II II INN IIMIII 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 1101 
TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 729 

I II llllllll MMM I I lllllllllll II I II 1 1 II lllllll 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1161 
TCAATTCTGGCGGCCGGGGTGACGTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 9 

IN I I llllllll 1 1 1 1 1 1 1 1 ! M i I 1 1 1 1 1 1 NINNNINN MMM 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1221 



I I M II II II II I M I II 1 1 1 M MM II II 1 1 INN 1 1 II II 1 1 II 1 1 1 1 1 1 II 



1280 



CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 90 9 

MMI I MMM INI 

GCCCTGGACACCAACTATTGCT 13 02 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 969 

^ M 1 1 Mil III 1 1 II M II I, I MM II II 1 1 IM NINNNINN lllllll 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 13 62 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 



970 TGGG CTGGAAGTGGATT CATG AA CCCAAGGG CTAC CATG C CAATTT CTG C CTGGGG CC CT 102 9 

I 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 II II IIIIIIMIIIIIIIMIM Mill ll;hl 

13 63 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1422 
1030 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 9 

I Mill I Mill I Mill I II 1 1 1 1 1 1 1 1 1 1 IN ! II I 1 1 II 1 1 1 1 1 1 1 1 1 

14 23 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 14 82 
1090 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I MIIIMIIII 1 1 II M II M M 1 1 1 1 M II II I II M M II 1 1 1 M 1 1 IMI.il 

14 8 3 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 154 2 
115 0 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 0 9 

■|IM I llllil III ll'IIIMII Ml II II I IMhli li: III I'l I'lMI ' I 

1543 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1602 
1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1269 

MIIMMIIIMIMIII MIIIIIIMM Ml II I I I Mill I 

1603 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCGGCCCCGCCCCGGCAGGCCCGGCCCCAC 1662 
1270 CCCACCCCCGCCCGCCT CAC CGGGG GTGTATTTAAGG A CAT CGTG C C C CAAG 1321 

Ml MM Ml M 1 1 MMMMMMIMMM I MMM 

1663 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCCGTGCCCAAG 1722 

1322 CCCACTTGGGATCGATTAAA 1341 

Mill MM I I II 
1723 CCCACCTGGGGCCCCATTAA 1742 



RESULT 11 

BC001180 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC001180 1780 bp mRNA linear PRI 12-JUL-2001 

Homo sapiens, Similar to transforming growth factor, beta 1, clone 
MGC:2323 IMAGE: 3356605, mRNA, complete cds . 
BC001180 

BC00118 0 . 1 GI : 12654682 
MGC . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota,- Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1780) 
Strausberg, R. 
Direct Submission 

Submitted (ll-DEC-2000) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.mh.gov 
Contact : MGC help desk 
Email : cgapbs-r@mail .nih.gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Genome Sequence Centre, 

BC Cancer Agency, Vancouver, BC, Canada 

inf o@bcgsc . be . ca 



Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterf ield, 
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, 
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo 
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven 
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline 
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, 
Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy, 
George Yang, Scott Zuyderduyn, Marco Marra . 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAL Plate: 6 Row: e Column: 10 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 37097. 

Location/Qualif iers 

1. .1780 

/organism="Homo sapiens" 

/ mo 1 _t ype = " mRNA " 

/dbjcref ="taxon: 9606" 

/clone="MGC:2323 IMAGE : 3356605 " 

/ 1 i s sue_type= " Eye , ret inoblas toma " 

/ c 1 one_l i b= n N I H_MGC_1 6 " 

/lab_host="DH10B-R" 

/note= "Vector : pOTB7 " 

447. .1619 

/codon_start=l 

/product="Similar to transforming growth factor, beta 1" 
/protein_id="AAH01180 . 1" 
/db_xref="GI : 12654683" 

/ trans la tion= "MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNE I YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGVVRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDI NGFTTGRRGDLATI HGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHN PGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS " 
341 a 648 c 493 g 298 t 



Query Match 73.0%; Score 987.6; DB 9; Length 1780; 

Best Local Similarity 85.4%; Pred. No. 1.4e-164; 

Matches 1144; Conservative 0; Mismatches 149; Indels 47; 



Gaps 



2; 



Qy 

Db 

Qy 

Db 
Qy 

Db 

Qy 



1 0 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 9 

II III 1 1 1 1 II 1 1 1 1 1 1 . 1 , I ' III MINIMI I ' ' 1 1 1 1 1 1 ! 1 1 1 1 I 

44 2 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTAC 501 
70 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGC^AGACCATCGACATGG 12 9 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMMMI 

502 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 561 
13 0 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 MMIMMM MM 

562 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 621 
190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill II II I 



Db 


622 


TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 


681 


Qy 


250 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 


309 


Db 


682 


1 M IIIIIIII MIMIIMIIMI IIIIIIII MM MMIIIIIIMIMI 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 


741 


Qy 
Db 


310 
742 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 MMI II II 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


369 
801 


Qy 


370 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

IIIIIIIIII II MUM 1 1 IIIIII IIIIIIII 1 IIIIIIII 1 1 1 

AAATCTATGACAAGTTCAAGCAGAGTACACAC^GCATATATATGTTCTTCAACACATCAG 


429 


Db 


802 


861 


Qy 


430 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 


489 


Db 


862 


MIMM MMMM M MMI M IMMMI 1 II 1 1 M M M II 1 lllllll 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


921 


Qy 
Db 


490 
922 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMI IMMMMMMMII IIIIII 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 


549 
981 


Qy 


550 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCGCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MM 1 . 1 1 M 1 1 1 1 1 ' 1 1 ' 1 1 1 1 1 1 1 Mill Mill II IIIIII 1 II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


609 


Db 


982 


1041 


Qy 


610 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

M M M M M 1 1 M M II M M M 1 1 Ml Ml Ml II 1 1 Mill MIMM 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


669 


Db 


1042 


1101 


Qy 
Db 


670 
1102 


TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

1 M IIIIIIII IMIIIIIIMII lllllllllll II Mill M MIMM 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 


729 
1161 


Qy 

Db 


730 
1162 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAAGCGGCCCTTCCTGC 

III 1 1 IMMMI i 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 ] f I llllllllllllll MIMM 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


789 
1221 


Qy 

Db 


790 
1222 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

1 M M II 1 1 II II 1 II M 1 1 M 1 1 II 1 1 II 1 II Mill MMIIMMMIIMM 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


849 
1280 


Qy 


850 


CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 

MMMM IIIIIIII MM 

GCCCTGGACACCAACTATTGCT 


909 


Db 


1281 


1302 


Qy 


910 


TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 


969 


Db 


1303 


1 1 1 1 1 II 1 II II II 1 1 II 1 1 1 M 1 II 1 1 1 M II 1 1 1 1 II II M II M II M MMIM 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 


1362 


Qy 


970 


TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 

1 1 1 1 1 1 : 1 ! 1 1 1 II 1 II II IIIIIIIIIIIIIMMMI MMMM lllllll 

TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 


1029 


Db 


1363 


1422 


Qy 


1030 


GTCCCTACATCTGGAGCCTAGAC^CTC^GTAC^GCAAGGTCCTGGCTCTGTACAACCAGC 

1 IIIIIIII IMMMI Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllllllllll 

GCCCCTACATTTGGAGCCTGGAC^CGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 


1089 


Db 


1423 


1482 



Qy 1090 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I 11111,111 IIIMIIIIIMIMIIIIIIIIIIIMIIIIMIIII 'Mill 

Db 14 83 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1542 

Qy 1150 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGGTGTCCAACATGATCGTGCGTT 12 09 

MMMMM IMIMMMMMI MMMM MMMIIMMMIII MM I 

Db 154 3 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1602 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1269 

MMMMMMMMMI MMMMMM MINIM MM! I 

Db 1603 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 1662 
Qy 1270 CCCAGCCCCGCGCGGCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1321 

III 1 1 1 1 Ml II II MMMMMMMMM I 1 1 1 1 1 

Db 1663 CCGGGCCCG CCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCCGTGCGCAAG 1722 

Qy 13 22 CCCACTTGGGATCGATTAAA 1341 

Mill I II I I I II 
Db 1723 GCCACCTGGGGCCCCATTAA 1742 



RESULT 12 

AGMTGFB 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 

FEATURES 

source 



CDS 



AGMTGFB 1561 bp mRNA linear PRI 27-APR-1993 

Simian transforming growth factor-beta (TGF) mRNA, complete cds . 
M16658 

M16658 .1 GI : 176552 

growth factor; transforming growth factor-beta. 
Cercopithecus aethiops (African green monkey) 
Cercopithecus aethiops 

Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 
Cercopithecinae; Cercopithecus . 
1 (bases 1 to 1561) 

Sharpies, K., Plowman , G . D . , Rose,T.M., Twardzik, D. R . and 
Purchio,A.F. 

Cloning and sequence analysis of simian transforming growth 

factor-beta cDNA 

DNA 6 (3) , 239-244 (1987) 

87246074 

3474130 

Original source text: African green monkey cells (cell line 
BSC-40) , cDNA to mRNA, clone pTGF-beta-2. 

Location/Qualifiers 

1. .1561 

/organism= M Cercopithecus aethiops " 
/mol_type= "mRNA" 
/db_xref ="taxon: 9534" 
262. .1434 

/note="transformmg growth factor-beta precursor" 
/codon_start=l 
/protein_id="AAA35369 . 1" 
/db_xref= n GI : 176553" 

/translation "MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTIDMELVKRK 
R I ETIRGQI LSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEI YDKFKQSTHSI YMFFNTSELREAVPEPVXiLSRAELRLLRL 



KLKVEQHVELYQKYSNNSWRYLSNRLIAPSNS PEWLSFDVTG WRQWLSRGGE I EGFR 
LSAHGSCDSKDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDT 
QYSKVIALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS " 
sig_peptide 262. .1095 

/note=" transforming growth factor-beta signal peptide" 
mat_peptide 1096. .1431 

/product= M transforming growth factor-beta" 

BASE COUNT 301 a 547 c 446 g 267 t 

ORIGIN 

Query Match 72.9%; Score 986.2; DB 9; Length 1561; 

Best Local Similarity 85.8%; Pred. No. 2.4e-164; 

Matches 1150; Conservative 0; Mismatches 143; Indels 48; Gaps 3; 

CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 9 

II III II 1 1 II MINIM lllllllllllll Mill I 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 316 
TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

I Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill Mill Ml I 1 1 1 1 1 1 1 1 1 1 

TGGTGCTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATGG 376 
AG CTGGTG AAG CGG AAG CG CAT CGAGG CCATT CG CGG C CAGATTCTGT C CAAG CTT CGG C 189 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I Mill MM III Ml 1 1 1 1 

AG CTGGTGAAG CGGAAG CG CAT CG AGA C CAT C CG CGG C CAGATCCTGT C CAAG CTG CGG C 4 36 

TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill M II I 
TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 4 96 

CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 09 

I II MMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM II MMMIMM I 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAAC 5 56 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 369 

I Mill M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 i 1 1 1 M I f 1 1 1 I MM 

CGGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 616 
AAATCTATGATAAATTC^AGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 429 

1 1 1 1 1 1 1 1 1 1 II MUM I I I MM MMMM I 1 1 1 1 1 1 1 1 II I 

AAATCTATGACAAGTTCAAGCAGAGCACA.CACAGCATA 676 
AG CT CCGGG AAG CGG TG C CGG AA C CTG TATTG CTCT CTCGGG CAGAG CTG CG C CTG CTGA 4 89 

I M II M Mill II II MMMM MMMM 1 1 1 1 1 1 1 M 1 1 1 1 1 MIMM 

AG CTCCGAGAAGCAGTACCTGAAC CTG TGTTGCTCTCCCGGG CAGAG CTG CGTCTG CTGA 736 
GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 9 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 



Qy 


10 


Db 


257 


Qy 


70 


Db 


317 


Qy 


130 


Db 


377 


Qy 


190 


Db 


437 


Qy 


250 


Db 


497 


Qy 


310 


Db 


557 


Qy 


370 


Db 


617 


Qy 


430 


Db 


677 


Qy 


490 


Db 


737 


Qy 


550 


Db 


797 


Qy 


610 



MINIM III MINIMI MINI III 1 1 1 1 III II II Mill 1 1 1 



Db 8 57 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCC 916 

Qy 67 0 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 9 

I M Illlllll 1 1 1 II 1 1 1 M II II 1 1 1 1 II II II 1 1 1 1 Mill II lllllll 

Db 917 TTAGCGCCCACTGGTCCTGTGACAGCAAAGATAAC^CACTGC^AGTGGACATCAACGGGT 976 

Qy 73 0 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 9 

III I I Illlllll IMIIIMIIIIII Mill I ' 1 1 1 1 1 1 1 1 ! 1 1 lllllll 

Db 977 TCACTACCGGGCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGC 1036 

Qy 7 90 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCAGCTGCACAGCTCCGGGCACCGCCGAG 84 9 

i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 iiiiiiiiiiiii ii inn [ r 1 1 1 1 1 1 1 1 1 r I I I I I I 

Db 1037 TTCTCATGGCCACGCCACTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCGGA- 1095 

Qy 850 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 90 9 

Illlllll lllllllllllll 
Db 1096 GCCCTGGACACCAACTACTGCT 1117 

Qy 910 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 969 

I MM I MM i MMMI I I M || lllllilllll lllllll 

Db 1118 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACC 1177 

Qy 970 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 9 

I MMMMMMM II II 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 

Db 1178 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCT 123 7 

Qy 103 0 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 9 

Illlllllll Illlllll Mill II II II M II M I M II M I IMMMIIIMI 

Db 1238 GTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 12 97 

Qy 1090 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I lllllilllll MM M MM M M M MM M MM M M M M M M M M M M 

Db 12 98 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 1357 

Qy 115 0 TCGTGTACTACGTGGGCCG(^GCC(^GGTGGAGCAGCTGTCCAACATGATCGTGCGTT 1209 

I M MM I M MM MM M MM M M M MM M Ml MM M M M I M M M M I 

Db 1358 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1417 

Qy 1210 CCTGC^AGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1269 

iiiiiii 1 1 1 1 f 1 1 1 1 1 r 1 1 1 r 1 1 1 1 1 1 1 1 1 iMiiiii mil 1 1 

Db 1418 CCTGCAAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGCCCGGCCCCGC 14 77 
Qy 127 0 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA-TCGTGCCCCAA 132 0 

Illlllll III II I IMIIIIIIMMIIMII MMIIIIII 

Db 14 78 CCCACCCCACCCCCGCTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGTGCCCCAA 1537 

Qy 1321 GCCCACTTGGGATCGATTAAA 1341 

MMII MM I I II 

Db 1538 GCCCACCTGGGGCCCCATTAA 15 58 



RESULT 13 
E03028 

LOCUS E03028 1821 bp RNA linear PAT 29-SEP-1997 

DEFINITION DNA encoding human prepro TGF-betal . 

ACCESSION E03028 

VERSION E03028.1 GI:2171250 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL, 

COMMENT 



JP 1991180192-A/l. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota ; Metazoa ; Chordata ; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1821) 
Ohashi,H., Ishii,Y., Miyata,Y 
Takaku, F . 

PRODUCTION OF HUMAN PRO-TGF-BETA1 BY GENETIC RECOMBINATION 
Patent: JP 1991180192-A 1 06-AUG-1991; 
KIRIN BREWERY CO LTD 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 

, Miyazono,K., Miyagawa,K. and 



Homo sapiens (human) 
JP 1991180192-A/l 

06- AUG-1991 

07- DEC-1989 JP 1989318243 
OHASHI HIDEYA, ISHII YASUYUKI 



OS 
PN 
PD 
PF 
PI 

KOHEI 
PI 
PC 
PC 
PC 
CC 
CC 
CC 
CC 
CC 
CC 
FH 
FH 
FT 
FT 
FT 

sig_peptide 
FT 
FT 
FT 
FT 
FT 



MIYATA YOSHINORI, PI MIYAZONO 



MIYAGAWA KIYOSHI, TAKAKU FUMIMARO 

C12P2l/00,C07K13/00,C12N5/10, C12N15/ 18 / /C12Q1/68 , (C12P21/00, 
C12R1:91) , 

(C12N15/18,C12R1 : 91) ; 
strandedness : Double; 
topology: Linear; 
hypothetical: No; 
anti-sense: No; 

*source : tissue__type=placenta ; 
clone=pVC19-TGF; 

Location/Qual if iers 



♦source : 
Key 



5 'UTR 
CDS 



mat_peptide 
mat_pept ide 



1. .510 
511. .1683 

/product = 'human prepro TGF-betal' FT 
511. .597 

598. .1680 

/note= 'human pro TGF-betal' 
1345. .1680 

/product = 'human TGF-betal' 
1684. .1820. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



3 'UTR 

Loca t ion/Qua 1 i f iers 
1. .1821 

/organism="Homo sapiens" 
/mol_type=" genomic RNA" 
/db_xref ="taxon: 9606" 
326 a 679 c 508 g 308 



Query Match 72.7%; 
Best Local Similarity 85.8%; 
Matches 1135; Conservative 



Score 983.4; DB 6 
Pred. No. 7.4e-164 
0; Mismatches 141 



Length 1821; 
Indels 47; Gaps 



2; 



Qy 



Db 



1 0 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 69 

II III 1 1 1 1 II M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 '1 1 1 1 : 1 i I Ml I II I II I I 

5 07 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 566 



Qy 7 0 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

I 1 1 1 II I III 1 1 : 1 II I MINIMI Mill I I "MM IMIIIMM 

Db 567 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 626 



Qy 


130 


AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

IIIIIMIIIIIMIIMIIIIIIIIIIIII IIIIIIIIIII IIIIIIIIIII MM 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 


189 


Db 


627 


686 


Qy 


190 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllllllllllllllllll Mill II II 1 
TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 


249 


Db 


687 


746 


Qy 


250 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

1 II llllllll 1 1 1 1 1 1 1 1 II ! 'I'M MM IIIIIIIIMMIMI 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 


309 


Db 


747 


806 


Qy 


310 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 Mill 1 1 1 il 1 M II 1 1 1 M 1 1 1 1 1 1 Ml 1 1 M 1 1 1 1 1 1 1 M II 1 Ml 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


369 


Db 


807 


866 


Qy 


370 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

llllllll 1 II Mill' 1 1 MUM llllllll 1 llllllll II 1 

AAAT CTATGA CAAGTT CAAG CAGAGTA CA CA CAG CATATATATGTT CTT CAA CA CAT CAG 


429 


Db 


867 


926 


Qy 


430 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

lllllll llllllll M Mill II 1 1 II II M llllllllllllll lllllll 
AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


489 


Db 


927 


986 


Qy 
Db 


490 
987 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

1 1 1 1 1 III 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 MM II 1 II 1 1 II 1 MUM 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 


549 
1046 


Qy 


550 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MM III 1 1 1 II 1 1 1 M 1 1 1 1 II 1 1 M Mill Mill M MUM 1 II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


609 


Db 


1047 


1106 


Qy 


610 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

IIIIIIIIIIIIIIIIMIIMIMI III III III II II Mill lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


669 


Db 


1107 


1166 


Qy 


670 


TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

1 II llllllll lllllllllllll IMMIIMM II Mill II lllllll 
TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 


729 


Db 


1167 


1226 


Qy 
Db 


730 
1227 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

Ml 1 1 llllllll llllllllllllllllllll lllllllllllll 1 lllllll 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


789 
1286 


Qy 


790 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 
1 II 1 1 1 II 1 1 II 1 1 1 II 1 1 II 1 1 1 1 1 1 i 1 1 II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 


849 


Db 


1287 


1 1 1 1 II II 1 1 II 1 1 II II 1 II II II 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


1345 


Qy 


850 


CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 

llllllll llllllll MM 

GCCCTGGACACCAACTATTGCT 


909 


Db 


1346 


1367 


Qy 


910 


TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 

1 M 1 M II M 1 II II 1 1 1 II 1 1 1 II 1 1 II 1 1 1 1 II II IIIIIIIIMIIII lllllll 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 


969 


Db 


1368 


1427 



Qy 97 0 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 9 

I MINIM Ml II II i I M MM M 1 1 1 M MM MMMM lllllll 

Db 1428 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1487 

Qy 103 0 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 1089 

I MMMM llllllll Mill III MMMM I I MMMMMMI 

Db 14 88 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 1547 

Qy 1090 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I MMMMMI M M M M M M M Ml M M M M M M M M M M I I MM 

Db 154 8 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1607 

Qy 1150 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 09 

IMMM MMMMMMMMMMMMMIMM MM MIMMMMI I 

Db 16 08 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1667 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1269 

MMMIMMMMMI MIMMMMI llllllll Mill I 

Db 1668 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 1727 
Qy 127 0 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1321 

III MM III II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMM I 

Db 1728 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCCAA 1787 

Qy 1322 CCC 1324 

II 

Db 1788 GCC 1790 



RESULT 14 

BC022242 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC022242 1746 bp mRNA linear PRI 04-FEB-2002 

Homo sapiens, clone MGC: 22008 IMAGE : 4399762 , mRNA, complete cds . 
BC022242 

BC022242 .1 GI : 18490115 
MGC. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1746) 

Strausberg,R. 

Direct Submission 

Submitted ( 01 -FEB-2002 ) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 

Contact : MGC help desk 

Ema i 1 : cgapbs - r@ma i 1 . nih . gov 

Tissue Procurement: ATCC 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Sequencing Group at the Stanford Human Genome 

Center, Stanford University School of Medicine, Stanford, CA 94305 

Web site: http://www-shgc.stanford.edu 

Contact: (Dickson, Mark) mcd@paxil.stanford.edu 



Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, 
R. M. 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort lum/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 27 Row: e Column: 21 

This clone was selected for full length sequencing because it 
passed the following selection criteria: Hexamer frequency ORF 
analysis . 

Loca t i on/ Qua 1 1 f i er s 
1. .1746 

/organism= "Homo sapiens " 

/mol_type=" mRNA " 

/db_xref="taxon: 9606" 

/clone="MGC: 22 008 IMAGE : 4 3 99762 " 

/ 1 i s sue_type= " Duodenum , adenoca rc inoma " 

/ c 1 one_l i b = " N I H_MGC_8 8 " 

/lab_host="DH10B" 

/note=" Vector: pCMV-SP0RT6" 

370. .1542 

/codon_start=l 

/product = "Unknown (protein for MGC:22008)" 
/protein_id= M AAH22242 .1" 
/db_xref="GI : 18490116" 

/ 1 rans 1 a t ion= " MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEIYDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGVVRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS " 
376 a 612 c 472 g 286 t 



Query Match 72 . 7%; 

Best Local Similarity 85.6%; 
Matches 1148; Conservative 



Score 983; DB 9; Length 174 6; 
Pred. No. 8.8e-164; 
0; Mismatches 145; Indels 48; 



Gaps 



3; 



Qy 



Db 



1 0 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 69 

II III 1 1 II II IIIIIIMIII III MINIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

3 65 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTAC 424 



Qy 
Db 

Qy 

Db 



7 0 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

I Ml I M M M 1 1 II M I M 1 1 M I II Mill 1 1 1 1 1 1 1 M 1 1 1 1 1 II i 1 1 1 1 M I 

4 25 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 4 84 



130 



189 



AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

IIIMIIIMIIIIIIIIIIIIIIIIIIIII IMIIMIIII IIIIIIMIII MM 

4 85 AG C TGGTGAAG CGG AAG CGCATCGAGGCCATCCGCGGC CAG AT C CTG T C CAAG CTG CGG C 54 4 



Qy 

Db 



190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMMMMMMMMI Mill II II I 

54 5 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 6 04 



Qy 



Db 



2 50 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 09 

I II r 1 1 1 1 1 1 1 MMMMMMII 1 1 1 1 1 1 1 1 MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

6 05 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 664 



Qy 


310 


Db 


665 


Qy 


370 


Db 


725 


Qy 


430 


Db 


785 


Qy 


490 


Db 


845 


Qy 


550 


Db 


905 


Qy 


610 


Db 


965 


Qy 


670 


Db 


1025 


Qy 


730 


Db 


1085 


Qy 


790 


Db 


1145 


Qy 


850 


Db 


1204 


Qy 


910 


Db 


1226 


Qy 


970 


Db 


1286 


Qy 


1030 


Db 


1346 


Qy 


1090 


Db 


1406 



CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAA.TGGTGGAAAGCGGCAACC 3 69 

I Mill 1 1 MM II 1 1 1 1 Ml II 1 1 : II I II 1 1 1 1 INI 1 1 1 1 1 1 1 1 1 I I II I 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 724 



MMMMM 1 1 IMMI I I MMM MINIM I llllllll II 



AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 9 

IM II llllllll II lllll II llllllll III' IMMI 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 844 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 9 

II I II 1 1 1 1 1 M 1 1 1 1 1 M M 1 1 1 II Ml M I M I MM 1 1 1 IM IM 1 1 IMMI 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 9 04 

GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 609 

MM I M M I IM II 1 1 1 1 1 1 II M I lllll lllll II IMMI I II 1 1 1 1 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 964 



II I II 1 1 1 1 1 1 1 1 1 1 ! M I M 1 1 1 M III III III II II Mill MINI 



TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 9 

I II llllllll MIIIIIIIIMI 1 1 1 1 1 1 1 1 1 1 1 II Mill II IMMI 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1084 



II I I llllllll J ! 1 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 I MMMMM MMM 



TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 8 4 9 

I 1 1 M 1 1 ' M I II I M M M 1 1 1 M M 1 1 Mill II 1 1 1 1 1 1 1 1 II 1 1 1 1 M 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 12 03 



I II I I I llllllll III 



TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 969 

II M M II M II II I II II II M 1 1 1 MM 1 1 1 M M II 1 1 1 1 1 1 1 1 1 1 1 1 IMMI 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 12 8 5 



MIIIIIIIIMI II II II II III 1 1 II I II M M 1 1 MM Ml IMMI 

IGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 
:CCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 

IMMI llllllll lllll MM II III II 1 1 M 1 1 M I MMMMIMM 

'CCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 
ACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 

MMMMM IIIIIIIIIIIIIIIIIIIIIIIIIIIMMMMIII MMM 



Qy 1150 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 0 9 

Mill III Ml Mill! IIIIIIIMI III II Ml II IMM MUM l! I III Ml I 

Db 14 66 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 152 5 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1269 

1 1 II 1 1 1 1 II I II 1 1 1 1 1 1 IMIMI MM MINIM Mill I 

Db 1526 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 1585 
Qy 12 7 0 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA - TCGTGCCCCAA 132 0 

Ml 1 1 1 1 III II II MMMIMMIMI III I r I | | I | | | | 

Db 1586 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCCGTGCCCCAA 164 5 

Qy 1321 GCCCACTTGGGATCGATTAAA 1341 

MINI II I I I I II 
Db 164 6 GCCCACCTGGGGCCCCATTAA 1666 



15 



RESULT 
106216 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 

BASE COUNT 
ORIGIN 



106216 
Sequence 
106216 
106216. 1 



1560 bp 
I from Patent EP 0293785. 

GI : 590649 



DNA 



linear PAT 02-DEC-1994 



Unknown . 

Unknown . 

Unclassified . 

1 (bases 1 to 1560) 

Purchio , A . F . , Gentry, L. and Twardzik,D. 

Cloning and expression of simian transforming growth factor-SSI 
Patent: EP 0293785-A2 2 07-DEC-1988; 
Locat ion/Qua 1 if iers 



301 



1. .1560 

/ organi sm= " unknown " 
a 547 c 442 g 



267 t 



Query Match 72 . 6%; 

Best Local Similarity 85.7%; 
Matches 1145; Conservative 



Score 981.8; DB 6 
Pred. No. 1.5e-163 
0; Mismatches 143 



3 others 

Length 1560; 
Indels 48; Gaps 



3; 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



15 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 74 

III MM II 1 1 1 1 1 1 1 1 1 1 1 MINIMUM! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 

261 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 32 0 
75 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 134 

1 1 1 1 . 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 MMMII llllll 

321 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATCGAGCTG 38 0 
13 5 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 194 

1 1 1 M M M II M M II 1 1 1 1 MM MMMIMM 1 1 1 1 1 1 1 1 1 1 1 Mill III 

381 GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 44 0 
195 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 2 54 

! 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II I II I II 1 1 1 1 II I M M Mill II II II II 

441 AG C C C CC CG AG C CAGGGGG AGGTG C CG C C CGG C C CG CTG C CCGAGG CCGTG CTCG C C CTG 5 00 



Qy 


255 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

IIIIIMI 1 1 1 1 II 1 1 1 II 1 1 IIIIIIII MM II lllllllll 1 1 II III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 


314 


Db 


501 


560 


Qy 


315 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

II IIIIIIIMIIIIIMMIIIIIIIIIIMIIMIIIIIIIII 1 MM lllll 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


374 


Db 


561 


620 


Qy 


375 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

Mill II llllll II 1 llllll IIIIIMI 1 IIIIIIII II llllll 
TATGAC^AGTTCAAGCAGAGCACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 


434 


Db 


621 


680 


Qy 


435 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

II lllll II II IIIIIIII IIIIIIII Illllll llllll 1 M 1 1 1 1 1 t 1 1 1 
CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


494 


Db 


681 


740 


Qy 


495 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

IMMMMM IIIIIIII IIIIIIII Mllllllll lllll IMMMMI 

AAGTTAAAAGTCGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 


554 


Db 


741 


800 


Qy 
Db 


555 
801 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllll MM MM MM MM lllllllll 

TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 


614 
860 


Qy 


615 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 III MM III II M Mill IIIIIIII II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTGGCCTTAGC 


674 


Db 


861 


920 


Qy 


675 


GCCC^CTGTTCCTGTGAC^GCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

IIIIIIII 1 1 1 1 1 1 1 1 i E E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II Mill II IMMMMI 1 

GCCCACTGCTCCTGTGACAGC^W^GATAAC^C^CTGC^GTGGAC^TCAACGGGTTCACT 


734 


Db 


921 


980 


Qy 


735 


TCTGGCCGGCGGGGTGACGTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

1 IIIIIIII 1 1 1 1 1 1 1 1 IE El Mill llllll MUM IIIIIIII III 

ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


794 


Db 


981 


1040 


Qy 


795 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGGGGAGCCCTG 

1 II 1 1 1 II M 1 1 M 1 1 1 II 1 1 1 1 1 1 1 II Mill llllllllllllllllll 
ATGGCCACCCCGGTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA 


854 


Db 


1041 


1094 


Qy 


855 


GATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGC 

IIIIIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 II Ml 

GCCCTGGACACCAACTACTGCTTCAGC 


914 


Db 


1095 


1121 


Qy 


915 


TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGC 

1 1 1 M 1 1 1 1 1 1 1 II 1 II 1 1 llllllli M IMMMMM IIIIIIII Ml 

TCCACGGAGAAGAACTGCTNCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGC 


974 


Db 


1122 


1181 


Ov 


975 


TGGAAGTGGATTCATGAACCCAAGGGCTACCA.TGCrAATTTCTnrPTnnnnrrrTnTr'rr 

MM: II M M IMMI MM IMMIII II 1 1 1 1 1 1 1 II M 1 1 1 II II 1 

TGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCC 




Db 


1182 


1241 


Qy 
Db 


1035 
1242 


TACATCTGGAGCCTAGACACTCAGTACAGGAAGGT^ 

lllll IIIIIIII lllll IMIMIIIIIIIIII IIIIIIIIIIIMI III 

TACATTTGGAGCCTGGACACGC^GTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAAC 


1094 
1301 


Qy 


1095 


C CGGG CG CGTCGGCGG CGC CGTG CTG CGTG C CG CAGG CG CTGGAG C CACTG CC CAT CGTG 


1154 



MINIM M III Mill Ml ' II MM Mi MM INI I Mill III MM Mill 

Db 13 02 CCGGG CG C CTCGG CGG CG C CGTG CTG CGTG CCG CAGG CG CTGG AG C CA CTG C C CATCGTG 1361 

Qy 1155 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGC 1214 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 ! 1 1 1 ! 1 1 1 1 1 ! 1 1 1 ! i 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M I M I Mill 

Db 13 62 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGA 1421 

Qy 1215 AAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCAC 1274 

1 1 Ml II III I I I Mill Mill 

Db 1422 AAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGCCCGGCCCCGCCCCAC 1481 
Qy 12 75 CCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA - TCGTGCCCCAAGCCCA 132 5 

Ml III M I 1 1 1 1 1 1 I ! I 1 1 1 I E E 1 1 1 1 I II I II 1 1 1 1 1 1 I M 

Db 14 82 CCCACCCCCGCTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGTGCCCCAAGCCCA 1541 

Qy 132 6 CTTGGGATCGATTAAA 1341 

I MM I I II 

Db 1542 CCTGGGGCCCCATTAA 1557 



Search completed: October 27, 2003, 18:35:10 
Job time : 5285.94 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



October 27, 2003, 10:47:27 ; Search time 406.484 Seconds 

(without alignments) 
8985.201 Million cell updates/sec 

US-10-017-372E-34 
1353 

1 gatctggtaccgagatggcg cgattaaagcggccgcgact 1353 

I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

2552756 seqs , 1349719017 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



5105512 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



N_Geneseq_19Jun03 : * 
1 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 0 . DAT: * 
2 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1981 . DAT: * 
3 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1982 .DAT: * 
4 : / SIDSl/gcgdata/geneseq/ geneseqn-embl/NAl 983 . DAT : * 
5 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1984 .DAT: * 



6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1985 . DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1986 . DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1987 . DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1988 . DAT: * 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 9 . DAT 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1990 . DAT 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1991 . DAT 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1992 . DAT 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1993 .DAT 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1994 . DAT 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1995 . DAT 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1996 . DAT 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1997 . DAT 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1998 . DAT 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA1999 . DAT 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA2000 . DAT 
/ SIDSl/gcgdata/geneseq/geneseqn-embl/NA2001A. DAT 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA2001B.DAT 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA2002 . DAT: 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA2003 . DAT: 



★ 
★ 
* 
★ 

. ★ 
. ★ 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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1224 . 6 


90 


. 5 


1326 


24 


AAD22696 


Porcine transformi 


2 


994.8 


73 


. 5 


2527 


25 


ABQ76674 


Androgen receptor 


3 


994 


73 


5 


2537 


7 


AAN60972 


Sequence encoding 


4 


994 


73 


5 


2537 


11 


AAQ03301 


cDNA encoding huma 


5 


994 


73 


5 


2537 


11 


AAQ02814 


Sequence of pre-TG 


6 


994 


73 


5 


2537 


17 


AAT15720 


Pre -trans forming g 


7 


992 .4 


73 


3 


2537 


15 


AAQ56923 


Human pre-TGF-beta 


8 


989.2 


73 


1 


2537 


19 


AAV52933 


Human pre-transfor 


9 


986.2 


72 


9 


1561 


11 


AAQ03268 


Simian transformin 


10 


984 .6 


72 


8 


2742 


22 


AAI58342 


Human polynucleoti 


11 


983.8 


72 


7 


1559 


13 


AAQ2 02 8 9 


Sequence encoding 


12 


983 .4 


72 


7 


1821 


12 


AAQ13392 


Human pro-TGF-beta 


13 


982.8 


72 


6 


1560 


9 


AAN81084 


Coding sequence of 


14 


982.8 


72 


6 


1560 


11 


AAQ03508 


Simian Transformin 


15 


981 


72 


5 


2745 


16 


AAT05876 


cDNA encoding tran 


16 


981 


72 


5 


2745 


22 


AAH28216 


Nucleotide sequenc 


17 


973 


71 


9 


1303 


11 


AAQ09317 


Monkey transformin 


18 


972 


71 


8 


4105 


15 


AAQ55624 


TGFbetal 5 ' -UTR-CD 


19 


970 


71 


7 


1571 


11 


AAQ03269 


Human transforming 


20 


965 


71 


3 


1569 


9 


AAN81085 


Coding sequence of 


21 


961.8 


71 


1 


1569 


11 


AAQ03509 


Human Transforming 


22 


955.8 


70 


6 


1561 


11 


AAQ04908 


Sequence encoding 


23 


952.6 


70 


4 


1561 


13 


AAQ29177 


TGF-beta 1/beta 2 


24 


933.4 


69 


0 


1176 


25 


ABV75391 


TGFB1 Arg25Pro pol 


25 


931.8 


68 


9 


1176 


24 


ABZ35738 


Human TGF beta 1 p 


26 


931.8 


68 


9 


1176 


24 


ABX09981 


Human TGFbetal DNA 


27 


931.8 


68 


9 


1176 


24 


ABV78162 


Human TGF beta 1 D 
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nuiikiii lor DcLdi/ lu 


36 
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. 8 


57 


4 


Ct X f 
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X 


AANQP)7£A 


octjUciiLc ui nuniari 


37 


770 


. 6 


57 . 


. 0 


2200 


16 


AAT04115 


S i mi an- human hybri 


c 38 


700 


51 . 


. 7 


2773 


23 


AAS84421 


DNA encoding novel 


39 


6* 


12 


50. 


. 4 


834 


12 


AAQ12192 


Sequence encoding 


40 


657. 


.2 


48 . 


. 6 


1376 


24 


ABK90341 


DNA encoding LAP-m 


41 


655. 


.6 


48 . 


, 5 


1389 


24 


ABK90344 


DNA encoding LAP-h 


42 


591. 


.4 


43. 


, 7 


1352 


24 


ABK90342 


DNA encoding mIFNB 


43 


589. 


. 8 


43 . 


, 6 


1350 


24 


ABK90343 


DNA encoding huIFN 


44 


538 


39. 


. 8 


875 


23 


AAS70979 


DNA encoding novel 


45 


366 , 


, 4 


27. 


. 1 


489 


24 


ABL99528 


Target canine gene 



ALIGNMENTS 



RESULT 1 
AAD22696 

ID AAD22696 standard; CDNA; 1326 BP. 
XX 

AC AAD22696; 
XX 

DT 26-FEB-2002 (first entry) 
XX 

DE Porcine transforming growth factor beta 1 (TGF-betal) cDNA. 
XX 

KW Porcine; transforming growth factor beta 1; TGF-betal; gene therapy; 
KW IBD; inflammatory bowel disease; autoimmune disease; immunosuppressive; 
KW multiple sclerosis; rheumatoid arthritis; systemic lupus erythematosus; 
KW diabetes mellitus; sarcoidosis; psoriasis; dermatological ; ss. 
XX 

OS Sus scrofa. 
XX 

FH Key Location/Qualifiers 
FT CDS 16. . 1188 

FT /*tag= a 

FT /product^ "Porcine TGF-betal mutant protein" 

XX 

PN WO200181404-A2 . 
XX 

PD 01-NOV-2001. 
XX 

PF 20-APR-2001; 2 0 0 1WO-US12 98 0 . 
XX 

PR 20-APR-2000; 2 000US- 199014 P . 
XX 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICES. 
XX 

PI Strober W, Nakamura K, Kitani A, Fuss IJ; 
XX 



DR WPI; 2002-026155/03. 

DR P-PSDB; AAE13596. 
XX 

PT Composition for treating autoimmune diseases e.g. inflammatory bowel 

PT disease in humans, comprises vector containing transforming growth 

PT factor-beta under the control of inducible promoter 
XX 

PS Claim 1; Fig 1; 78pp ; English. 
XX 

CC The invention relates to a composition containing a vector comprising a 

CC gene encoding a regulatory transcription factor under the control of a 

CC promoter encoding a transforming growth factor-beta (TGF-beta) . The 

CC vector is useful for expressing TGF-beta, such as TGF-betal, TGF-beta2 

CC or TGF-beta3, its variants or homologues, by transfecting a cell which 

CC is part of a host suspected of having an autoimmune disease, especially 

CC inflammatory bowel disease (IBD) , under conditions such that the 

CC polypeptide encoded by the nucleic acid sequence in the vector is 

CC expressed. The vector is delivered using a delivery system. The delivery 

CC of the vector results in substantial elimination of symptoms of the 

CC autoimmune disease and increased production of IL-10 by the host. The 

CC composition is useful for treating various diseases with an autoimmune 

CC component such as multiple sclerosis, rheumatoid arthritis, systemic 

CC lupus erythematosus, insulin-dependent diabetes mellitus, sarcoidosis 

CC and psoriasis, and also for assaying the expression of a gene in a cell. 

CC The vector is further useful for screening of the effect of test 

CC compounds on cytokine (e.g. TGF-beta) expression of transfected cells. 

CC The present sequence is a cDNA encoding porcine TGF-betal mutant. 

XX 

SQ Sequence 1326 BP; 263 A; 438 C; 392 G; 233 T; 0 other; 

Query Match 90.5%; Score 1224.6; DB 24; Length 1326; 

Best Local Similarity 96.3%; Pred. No. 3.7e-241; 

Matches 1301; Conservative 0; Mismatches 9; Indels 41; Gaps 3; 
Qy 1 GATCTGGTACCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGC 60 

1 1 1 1 1 : 1 1 1 1 1 M 1 1 il I ! I MM 1 1 1 1 1 1 1 1 1 1 1 1 II I II I II 1 1 II 1 1 1 ! II i 1 1 1 1 

Db 2 GATCTGGTACCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGC 61 

Qy 61 TGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCA 12 0 

II I il I II I M 1 1 M Ml 1 1 III IM I M 1 1 M M 1 1 III Ml III I Ml I M M M 1 1 1 

Db 62 TGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCA 121 

Qy 121 TCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCA 18 0 

I M M 1 1 1 1 M 1 1 M I II 1 1 1 II M MM 1 1 1 1 M 1 1 II I Ml III I MM I MMI 1 1 1 M 

Db 122 TCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCA 181 

Qy 181 AGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGG 24 0 

I III II II 1 1 MM II III M I M II M I II I II 1 1 M IM 1 1 M I MMI M 1 1 1 

Db 182 AGCTTCGGCTCGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGG 241 

Qy 241 CAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGG 3 00 

I MM MM' IMI I II IMIMMMMMIIM IMiM 1 IMIMMIM 

Db 242 CCGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGG 3 01 

Qy 3 01 AGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAA 360 

M 1 1 1 1 ! M 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 

Db 3 02 AGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAA 3 61 



Qy 


361 


Db 


362 


Qy 


421 


Db 


422 


Qy 


481 


Db 


482 


Qy 


541 


Db 


542 


Qy 


601 


Db 


602 


Qy 


661 


Db 


662 


Qy 


721 


Db 


722 


Qy 


781 


Db 


782 


Qy 


841 


Db 


842 


Qy 


901 


Db 


863 


Qy 


961 


Db 


923 


Qy 


1021 


Db 


983 


Qy 


1081 


Db 


1043 


Qy 


1141 


Db 


1103 



GCGGCAAC(^U\ATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCA 42 0 

I ll I il 1 . 1 1 . 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 II M 1 1 1' 1 1 1 1 II I; I 

GCGGC^CC^^TCTATGATAAATTCAAGGGCACCCCCC^CAGCTTATATATGCTGTTCA 421 
ACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGC 4 8 0 

II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M I! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 , 1 Ml 1 1 1 1 1 1 1 II 1 1: 1 1 1 1 1 1 

ACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGC 4 81 

GCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCA 54 0 

I I I I I I I I I II I I I I I I I Ml I I I I I I I I I ; I I I I I I I I I I I I I I I I II I II I I I I M 
GCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCA 541 

ATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGC 6 00 

1 1 1 II II I II I II I II II M 1 1 M II II I II I II I M III M M I M M I M M II 1 1 

ATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGC 601 
TGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGG 660 

1 1 1 i ) 1 1 1 1 e 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGG 661 
GTTTTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAA 72 0 

Illllllllllllllllllll Mill MIIIIIIIIIIIIMMIMI II MIMI 

GTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAA 721 
TTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGC 78 0 

II 1 1 1 i I i 1 1 i 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 j j 1 1 1 . 1 1 1 j 1 1 1 1 1 1 1 i 1 1 1 1 1 : j 

TTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGC 781 
CCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGC 84 0 

I II Ml I II M III M II 1 1 1 1 1 I MMI II I II I M II 1 1 IM III M I M I II 1 1 1 

CCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGC 841 

ACCGCCGAGCCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCA 90 0 
IIIIMM II 
ACCGCCGA GCCCTGGATACCA 8 62 

ACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCC 960 

1 1 M 1 1 1 1 1 1 1 1 ! M I M I M M I M I II M IM ! M I M 1 1 M 1 1 1 1 M I II I M M 

ACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCC 922 

GGAAGGAC CTGGG CTGGAAGTGGATT CATGAAC C CAAGGG CTA C CATG C CAATTT CTG CC 102 0 

I I I I I ' ' I I 

GGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCC 982 

TGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGT 108 0 

I Ml I M I II MM I M II M MM I MMI I II II I M I III I II I II 1 1 M M I II III 

TGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGT 104 2 

ACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGC 114 0 

M I I I I I I I I . I I I I : I I ! I I I I I I I I ! I I 1 I I .' I I I I ' I 1 I I I I I I I I M I I I I I ' I 
ACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGC 1102 

CACTGCCC^TCGTGTACTACGTGGGCCGO^GCCCAAGGTGGAGCAGCTGTCCAACATGA 12 00 

. , ' , I ' I I I i: 

C^CTGCCCATCGTGTACTACGTGGGCCGCAAGCCO^GGTGGAGCAGCTGTCCAACATGA 1162 



Qy 12 01 TCGTGCGTTCCTG(^GTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGC 12 6 0 

I II I I I I I M I I M I I I I I I I I I I I I I I I I I I I I ,111111 : I - i 

Db 1163 TCGTGCGTTCCTGCAAGTGCAGCTGA-GCCCCGCCCCGCCCACAGCCCCGCCCACCCGGC 1221 

Qy 1261 AGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAA 1320 

IIIIIIMIIIIIIMIIII lllllllllllllllllll ll'llllhll Illlll 

Db 1222 AGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAA 1281 

Qy 1321 G C CCA CTTGGG AT CG ATTAAAG CGG C CG CGA 13 51 

Illlll lllllllllllllll I I II 
Db 1282 GCCCAC- TGGGATCGATTAAAGGTGGAGAGA 1311 

RESULT 2 
ABQ76674 



ID ABQ76674 standard; DNA; 2527 BP. 
XX 

AC ABQ76674; 
XX 

DT 26-MAR-2003 (first entry) 
XX 

DE Androgen receptor signalling pathway-associated DNA E00973 . 
XX 

KW Androgen receptor; transact ivation; modulator; Smad3 ; Smad4 ; Akt; TGF-B; 
KW signal transduction pathway; transforming growth factor-B; phosphatase; 
KW tensin; cytostatic; antiproliferative; cellular proliferation; cancer; 
KW E00973; ds . 
XX 

OS Synthetic. 
XX 

PN WO200282081-A2 . 
XX 

PD 17-OCT-2002. 
XX 

PF 05-APR-2002; 2002WO-US11086 . 
XX 

PR 06-APR-2001; 2 0 0 1US- 282266P . 
PR 13-MAR-2002; 2 0 02US- 3 65 060P . 
XX 

PA (UYRP ) UNIV ROCHESTER. 
XX 

PI Chang C; 
XX 

DR WPI; 2003-046871/04. 
XX 

PT Modulating androgen receptor activity, by administering a compound that 

PT modulates receptor activity, inhibits receptor-signal transduction 

PT pathway/receptor-coactivator interaction or changes amount or receptor 

PT - 

XX 

PS Disclosure; Page 225-226; 302pp ; English. 
XX 

CC This invention describes a novel method for modulating androgen receptor 

CC activity or androgen receptor-mediated transact ivation activity in a 

CC cell. The method involves administering a compound which causes 

CC modulation of the androgen receptors activity and the inhibition of 

CC interaction between the receptor and a protein involved in a signal 



CC transduction pathway. The compound also inhibits the interaction between 

CC the androgen receptor and a protein selected from Smad3 , Smad4 , Akt, 

CC transforming growth factor (TGF) -B and phosphatase and tensin homologues 

CC deleted on chromosome 10 (PTEN) or their fragments. The compounds of the 

CC invention have cytostatic and antiproliferative activity. The obtained 

CC composition is useful for treating any disease, where uncontrolled 

CC proliferation or cellular proliferation occurs such as cancer, e.g. 

CC prostate cancer. This sequence represents the androgen receptor 

CC transactivation signalling pathway modulator E00973 described in 

CC the method of the invention. 

XX 

SQ Sequence 2527 BP; 472 A; 888 C; 735 G; 432 T; 0 other; 

Query Match 73.5%; Score 994.8; DB 25; Length 2527; 

Best Local Similarity 85.9%; Pred . No. 4e-194; 

Matches 1144; Conservative 0; Mismatches 147; Indels 41; Gaps 2; 
Qy 10 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 9 

II III 1 1 1 1 II MINI.. MM, Mill MINIUM I 

Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 

Qy 7 0 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

I INNNIMMMM MINIMI Mill 1 1 1 1 1 1 1 1 M 1 1 1 1 MINIUM 

Db 8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 

Qy 13 0 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

1 1 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 i I 1 1 1 1 1 1 1 1 1 1 1 mi 

Db 957 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 

Qy 190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 MM I M Mill II II I 

Db 1017 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 

Qy 2 50 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 09 

I II 1 1 1 1 1 1 1 1 INNNNINN 1 1 1 1 [ I E I MM MNNNIIIIMM 

Db 1077 CCCTCTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 

Qy 310 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 69 

I INN 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 I INI 

Db 1137 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 

Qy 3 70 AAATCTATGATAAATTCAAGGG(^CCCCCC^(^GCTTATATATGCTGTTa^ACACGTCGG 4 29 

1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 i i www 1 1 1 r r 1 1 1 i 1 mm ii i 

Db 1197 AAA T CTATGACAAGTTCAAG CAG AGTA CA CA CAG CATATATATGTT CTT CAACACAT CAG 1256 

Qy 43 0 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 9 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II INN II MINIM 1 1 1 1 1 i 1 1 1 1 1 [ I [ lllllll 

Db 12 57 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 

Qy 4 90 GGCTC^GTTAAAAGTGGAGC^GCA.CGTGGAGCTATACCAGAAATACAGa^ATGATTCCT 54 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I f I I I I I I I 
Db 1317 GGCTCAAGTTAAAAGTGGAGCAGCA.CGTGGAGCTGT^ 1376 

Qy 55 0 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 6 09 

INI IIIIIIIIIIIINIIIIINII INN INN II MM I II II 1 1 

Db 13 77 GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 143 6 



Qy 


610 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

1 1 1 Ml ! 1 . 1 1 1 1 1 1 1 1 1 II 1 1 Ml Ml Ml III II II Mill lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


669 


Db 


1437 


1496 


Qy 


670 


TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

1 II MMMM Mill MM ! lllllllllll M MIM II lllllll 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 


729 


Db 


1497 


1556 


Qy 


730 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 


789 


Db 


1557 


Ml 1 1 MMMM IMMMMMMIMMM MIIMMMMM MMMI 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


1616 


Qy 


790 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

1 1 1 MM II M M M M 1 Ml MM MM 1 1 M Mill 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


849 


Db 


1617 


1675 


Qy 


850 


CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 


909 


Db 


1676 


MMMM MMMM MM 

GCCCTGGACACCAACTATTGCT 


1697 


Qy 


910 


TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 

MMMM MM Ml M M MIM MMMM MM M ! ! MIIMMMMM MMMI 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 


969 


Db 


1698 


1757 


Qy 


970 


TGGGCTGGAAGTGGATTCATGAACCC^AGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 

1 llllllllllllll M II Mill IMMMMIMM MMMM lllllll 

TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 


1029 


Db 


1758 


1817 


Qy 


1030 


GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 

1 MMMM llllllll Mill Ml 1 II 1 II 1 1 1 Ml 1 IM IMMIMIIMI 

GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 


1089 


Db 


1818 


1877 


Qy 


1090 


A CAA C CCGGG CG CGTCGG CGG CG CCGTG CTG CGTG C CG CAGG CG CTGGAG CCA CTGC CCA 

1 lllllllllll M II MM 1 M II II 1 II 1 MM 1 1 1 MM 1 1 1 II MM MMMI 

ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 


1149 


Db 


1878 


1937 


Qy 


1150 


TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 

MIM IMIMMMMMM II II MMi; II II 1 1 II II 1 II 1 1 1 1 1 1 1 1 1 1 

TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGGAGCTGTCCAACATGATCGTGCGCT 


1209 


Db 


1938 


1997 


Qy 


1210 


CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 

1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 M llllllll Ml 1 lllllllil M III 

CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCC - - CCGCCCCGCCCGGCCCCACCCCGCCCC 


1269 


DD 


1998 


2055 


Qy 


1270 


CCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCCACTTG 


1329 


Db 


2056 


II M 1 III 1 M MIMIIMIIIIIIIMI MIMMMMIMM II 

GCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAGCCCACCTG 


2115 


Qy 


1330 


GGATCGATTAAA 1341 

Ml III 

GGGCCCCATTAA 2127 




Db 


2116 





RESULT 3 
AAN60972 

ID AAN60972 standard; cDNA; 2537 BP. 
XX 





AAN60972; 




AA 






U 1 


31-OCT-2002 


(updated) 


DT 


28-OCT-1991 


(first entry) 


YY 






DE 


Sequence encoding preTGF-beta. 


XX 








Transforming 


growth factor beta; cancer; wound healing. 


XX 






OS 


Unidentified 




XX 






r n 


Key 


Location/Qualifiers 


RT 


misc_structure 37 . . 113 


r 1 




/*tag= a 


r i 




/note= "Sequence can form stable hairpin loops" 


FT 
r i 


CDS 


842.. 2014 


FT 




/*tag= b 


FT 
r i 


mat_peptide 


1676. .2011 


FT 




/*tag= c 








PM 


EP200341-A. 




XX 






pn 

ir U 


10-DEC-1986. 




XX 






PF 


21-MAR-1986; 


86EP-0302112 . 


yy 

AA 






DD 

tr rv 


22-MAR-1985; 


85US-0715142 . 


PR 


13-MAR-1987; 


87US-0025423 . 


yy 

AA 






PA 


(GETH ) GENENTECH INC. 


yy 

AA 






PI 


Derynck RMA; 




yy 

AA 






DR 


WPI; 1986-326875/50. 


DR 


P-PSDB; AAP61468. 


XX 






PT 


TGF-beta prodn. from transformed hosts - useful esp. for treating 


PT 
r 1 


wounds (J6 2/9/86) . 


y y 

AA 






DC 


Disclosure; 


Fig lb; 26pp; English. 


XX 






cc 


The gene product is known to stimulate cell proliferation and 


cc 


inhibit anchorage-dependent growth of a variety of human cancer eel 


cc 


lines, it is 


esp. useful in treatment of burns and the promotion of 


cc 


surface and 


internal wound healing. TGF-beta may be expressed from « 


cc 


transformed 


CHO cell line. 


cc 


(Updated on 


31-OCT-2002 to add missing OS field.) 


XX 






SQ 


Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 



Query Match 73.5%; Score 994; DB 7; Length 2 53 7; 

Best Local Similarity 85.7% ; Pred . No. 5.8e-194; 

Matches 1148; Conservative 0; Mismatches 145; Indels 47; Gaps 2; 
Qy 10 CCGAGATGG CG C CTTCGGGG CTG CGG CT CTTG C CG CTG CTG CTGCCG CTG CTGTGG CTG C 69 

M III 1 1 1 1 II I F 1 . 1 1 1 1 1 1 1 lllllllllllll .1 II I MINIM I 

Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 



Qy 


70 


Db 


897 


Qy 


130 


Db 


957 


Qy 


190 


Db 


1017 


Qy 


250 


Db 


1077 


Qy 


310 


Db 


1137 


Qy 


370 


Db 


1197 


Qy 


430 


Db 


1257 


Qy 


490 


Db 


1317 


Qy 


550 


Db 


1377 


Qy 


610 


Db 


1437 


Qy 


670 


Db 


1497 


Qy 


730 


Db 


1557 


Qy 


790 


Db 


1617 


Qy 


850 


Db 


1676 



llllllllllllllll lllllllll Mill llllilllllllll llllllllll 



AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

i m 1 1 r i ii i ii 1 1 1 1 ii 1 1 1 1 1 ii 1 1; iiiiiiiiiii iiiiiiiiiii 1 1 1 1 

AG CTGGTG AAG CGGAAG CG CAT CGAGG C CAT C CG CGG C CAGAT CCTGTCCAAG CTG CGG C 1016 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 249 

I I I MM IIIIIIIIMM II II 1 1 1 1 1 1 1 1 1 1 1 1 1 II. Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 



I MIMIII llllllllllllll llllllll MM IIIIIIIIIIIII 



CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 69 

I Mill I II 1 1 1 1 1 1 1 II I M 1 1 II I II II 1 1 1 M 1 1 1 1 1 II 1 1 II 1 1 I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 

AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATAT^ 4 29 

llllllllll II MUM I I MUM llllllll I llllllll II I 

AAATCTATGACAAGTTCAAGG&GAGTACACACA 1256 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 9 

lllllll llllllll II Mill II llllllll llllllllllllll lllllll 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 9 

I llllllll Ml Ml III MINI MM IIIM 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 MINI 

GGCT(^GTTAAAAGTGGAG(^G(^CGTGGAGCTGTACC^GAAATAC^GC^(^ATTCCT 1376 

GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 609 

MM . 1 . 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 II I Mill IIIM II llllll I II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 143 6 

ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

1 1 . 1 1 1 i I 1 1 1 1 r 1 1 1 1 1 1 1 1 : 1 ! I Ml Ml Ml II II Mill lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 

TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 729 

I M IIIIIMI IMIIIIIMIII IIIMMIIII M Mill II lllllll 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAAC^CA 155 6 

TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 9 

III I I llllllll 1 1 1 1 1 M I MM III 1 1 1 1 1 llllllllllllll lllllll 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 

TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 9 

I 1 1 1 1 1 1 Mill I M 1 1 1 1 1 III II I ! II II I Mill 1 1 II 1 1 II M I M 1 1 1 1 1 

TTCTOVTGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 

CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 909 

IIIIIMI MIMIII MM 

GCCCTGGACACCAACTATTGCT 1697 



Qy 910 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 969 

1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 III II 1 1 1 1 M 1 1 1 MINIMUM Ml I 

Db 16 98 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 17 57 

Qy 97 0 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 1029 

i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii ii 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 e 1 1 1 1 1 1 1 1 1 1 1 1 mm; 

Db 1758 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1817 

Qy 103 0 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 9 

I MINIM MM INN 1 1 1 1 1 1 1 M M I M 1 1 1 1 1 1 MMM MM I 

Db 1818 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 1877 

Qy 10 90 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 1 1 II I II II 1 1 1 1 1 II I Ml II M Ml 1 1 MIIMI 

Db 1878 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

Qy 1150 TGGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 09 

1 1 1 1 m 1 1 e 1 1 1 1 ft 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 i 

Db 1938 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1997 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 12 69 

1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 MMM II II I II I I I INN I 

Db 19 98 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 2 057 
Qy 127 0 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1321 

III NN III II II III MINIMUM II 1 1 1 1 1 J 1 1 1 : 1 

Db 2 058 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAG 2117 

Qy 1322 CCCACTTGGGATCGATTAAA 1341 

INN INI I I II 
Db 2118 CCCACCTGGGGCCCCATTAA 2137 

RESULT 4 
AAQ03301 

ID AAQ03301 standard; DNA; 2537 BP. 
XX 

AC AAQ03301; 
XX 

DT 25-MAR-2003 (updated) 

DT 05-AUG-1990 (first entry) 

XX 

DE cDNA encoding human pre-transf orming growth f actor-beta-1 (pre-TGF-beta- 

DE 1). 

XX 

KW Transforming growth factor-beta-1 (TGF-beta-1) ; 

KW neoplastic cell line inhibition; 

KW EGF-potentiated anchorage- independent growth; 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 842 . .2014 

FT /*tag= a 

FT mat__peptide 1676.. 2011 

FT /*tag= b 

FT misc difference 37.. 113 



FT /*tag= c 

FT /note="stable hairpin loops'* 

FT misc_f eature 2015.. 2100 

FT /*tag= d 

FT /note="G-C rich sequence 

FT and a downstream TATA- 1 ike sequence" 

XX 

PN US4886747-A. 
XX 

PD 12-DEC-1989. 
XX 

PF 13-MAR-1987; 87US- 0025423 . 
XX 

PR 13-MAR-1987; 87US- 0025423 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1990-051338/07. 

DR P-PSDB; AAR05258. 
XX 

PT Nucleic acid encoding transforming growth factor-beta - 

PT cloned into expression vectors for expression in eukaryotic host 

PT cells for therapeutic use 

XX 

PS Disclosure; Fig lb; 28pp; English. 
XX 

CC It was obtained by an analysis of several overlapping cDNAs and gene 

CC fragments, leading to the detn . of a continuous sequence corresp . to the 

CC TGF-beta-1 precursor mRNA. It is useful in constructing vectors that 

CC encode biologically active transforming growth factor (TGF-beta) , 

CC operably linked to DNA that encodes a secretory leader (SL) . It, or a 

CC nucleic acid capable of hybridising with it, can also be labelled and 

CC used in diagnostic assays for DNA or mRNA encoding TGF-beta or related 

CC proteins. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 

Query Match 73.5%; Score 994; DB 11; Length 2537; 

Best Local Similarity 85.7%; Pred. No. 5.8e-194; 

Matches 1148; Conservative 0; Mismatches 145; Indels 47; Gaps 2; 

Qy 10 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 69 

II Ml I I I I II lllllllllil lllllllllllll llllllllllllll I 
Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 

Qy 70 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

I Mllllllllllllll IIIIIIIM Mill 1 1 F 1 1 1 1 1 1 1 1 1 1 1 llllllllll 

Db 8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 

Qy 13 0 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

IMIIIIIIIII llllll IMIIIIIIIMI lllllllllil llllllllll 1 1 II 

Db 957 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 



Qy 



190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 



Db 


1017 


Qy 


250 


Db 


1077 


Qy 


310 


Db 


1137 


Qy 


370 


Db 


1197 


Qy 


430 


Db 


1257 


Qy 


490 


Db 


1317 


Qy 


550 


Db 


1377 


Qy 


610 


Db 


1437 


Qy 


670 


Db 


1497 


Qy 


730 


Db 


1557 


Qy 


790 


Db 


1617 


Qy 


850 


Db 


1676 


Qy 


910 


Db 


1698 


Qy 


970 


Db 


1758 


Qy 


1030 



I IMIIMIIMIIIIIIMII 1 1 1 II 1 1 ! I M III I II 1 1 1 Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 09 

I II llllllll llllllllllllll IMMIII MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 
CAG AGG CGGACTACTA CG C CAAGGAGGT CAC C CG CGTG CTAATGGTGGAAAG CGG CAAC C 369 

I Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 
AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 9 

Illlllllll II MMM I I MMM llllllll I IMMIII II I 

AAATCTATGACAAGTTCAAGC^GAGTAC^CACAGCATATATATGTTCTTCAACACATCAG 12 56 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 9 
lllllll llllllll II Mill II llllllll M I I I I I I ; I I I I lllllll 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 
GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 9 

III !l I < 1 1 II 1 1 1 II II I II 1 II 1 1 1 1 1 1 1 1 M 1 1 Ml III I M 1 1 1 1 1 1 MMM 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 13 7 6 



MM MMMMMMMMIMMM Mill Mill II MMM I II II 



ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

II I II I M I 1 1 1 1 II II I II I II I Ml Ml Ml II M Mill lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 

TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 9 

I II llllllll lllllllllllll I! MM, !il II Mill II lllllll 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1556 

TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 789 

III I I llllllll IMMMMMMMIMM llllllllllllll lllllll 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 

TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 9 

I Ml Ml II llllllll III III III III I Mill IIIMIIIIIIMIIIII 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 



llllllll llllllll MM 
-GCCCTGGACACCAACTATTGCT 1697 



I II II II II II I II II I I I M I I I I I I II II llllllllllllll lllllll 



Illlllllll 



I II Mill Mill II lllllll MIIIMI lllllll 



I llllllll IMMIII Mill II lllllll MIIMIMM lllllllllllll 



Db 



1818 GCCCCTAC^TTTGGAGCCTGGAC^CGCAGTAC^GC^GGTCCTGGCCCTGTACAACCAGC 1877 



Qy 1090 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I IIIIIIIIMI I Mill IIIIIIIIIIIIIIIIMIIIIIIIIII I lllllll 
Db 18 78 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

Qy 1150 TCGTGTACTACGTGGGGCGGAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 1209 

1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 1 1 1 1 1 I 

Db 193 8 TCGTGTACTAGGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1997 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCGCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1269 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E I 1 1 1 1 1 1 1 1 1 1 1 1 II I II I I I Mill I 

Db 1998 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCGGCCCCGGCAGGCCCGGCCCCAC 2 057 
Qy 1270 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1321 

III MM II! II II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 2 0 58 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAG 2117 

Qy 1322 CCCACTTGGGATCGATTAAA 1341 

MMI MM I I II 

Db 2118 CCCACCTGGGGCCCCATTAA 2137 



RESULT 5 
AAQ02814 



ID 
XX 
AC 
XX 
DT 
DT 
DT 
XX 
DE 
XX 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



AAQ02814 standard; cDNA; 2537 BP. 



AAQ02814; 

25-MAR-2003 
31-OCT-2002 
31-MAY-1989 



(updated) 
(updated) 
(first entry) 



Sequence of pre-TGF-betal cDNA. 

Transforming growth factor beta-3 
inhibition . 



(TGF beta 3) ; tumour cells; growth 



Homo sapiens. 

Key 
CDS 



CDS 



GC_signal 



misc feature 



stem_loop 



misc feature 



Location/Qualifiers 
842. .2011 
/*tag= a 

/label=pre-TGF beta 1 
1677 . .2011 
/*tag= b 

/label =mature TGF-beta 1 
2015. .2092 
/*tag= c 
2093 . .2099 
/*tag= d 

/label=TATA-like sequence 
37. .113 
/*tag= e 
863 . . 911 
/*tag= f 

/label=hydrophobic domain 



XX 

PN WO8912101-A. 
XX 

PD 14-DEC-1989. 
XX 

PF 08-JUN-1988; 88WO-US0 1 945 . 
XX 

PR 08-JUN-1988; 88WO-US01945 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Dernyck RMA, Goeddel DV; 
XX 

DR WPI; 1990-007474/01. 

DR P-PSDB; AAR04034. 
XX 

PT Nucleotide sequence encoding transforming growth factor beta-3 -used as a 

PT probe, or to produce TGF beta 3, for growth inhibition of certain normal 

PT and neoplastic cells, eg A549. 
XX 

PS Disclosure; Fig. lb; 61pp; English. 
XX 

CC Sequence encodes the 390 amino acid (AA) precursor transforming growth 

CC factor-beta 1 (pre-TGF-beta 1) polypeptide. The 5' untranslated region of 

CC the TGF-beta 1 mRNA is 841 bases long, is purine rich and has a region of 

CC potential secondary structure. The TATA- 1 ike sequence in the 3* untrans- 

CC lated region of the gene is presumably a polyadenylat ion signal. Mature 

CC TGF-beta 1 comprises the C-terminal 112 AA's of pre-TGF-beta 1 and is 

CC cleaved at the Arg-Arg dipeptide preceding its NH2 terminus. The nucleic 

CC acid encoding the second subtype of TGF-beta (TGF-beta 3) is useful as a 

CC probe or to produce TGF-beta 3 for inhibition of growth of normal and 

CC neoplastic cells. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 

CC (Updated on 25-MAR-2003 to correct PR field.) 

CC (Updated on 25-MAR-2003 to correct PI field.) 
XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 

Query Match 73.5%; Score 994; DB 11; Length 2537; 

Best Local Similarity 85.7%; Pred. No. 5.8e-194; 

Matches 1148; Conservative 0; Mismatches 145; Indels 47; Gaps 2; 
Qy 10 CCGAGATGG CG CCTTCGGGG CTG CGG CT CTTG CCG CTG CTG CTG CCG CTG CTG TGG CTG C 69 

II III 1 1 1 1 II MIMIMIII lllllllllllll MINIM MM I 

Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 

Qy 70 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

I llllllllllllllll MINIMI Mill lillllllllllll IIMIIIIII 

Db 8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 

Qy 13 0 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

I I I : I I I ' I I I I ' ' I lllllllllll lllllllllll INI 

Db 957 AG CTGGTGAAG CGGAAG CG CAT CGAGG C CATCCG CGG C CAGAT C CTGTC CAAG CTG CGGC 1016 

Qy 190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I MM II I II II I IIIIIIIIIIMIIIMIM Mill II II I 

Db 1017 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 



Qy 


250 


Db 


1077 


Qy 


310 


Db 


1137 


Qy 


370 


Db 


1197 


Qy 


430 


Db 


1257 


Qy 


490 


Db 


1317 


Qy 


550 


Db 


1377 


Qy 


610 


Db 


1437 


Qy 


670 


Db 


1497 


Qy 


730 


Db 


1557 


Qy 


790 


Db 


1617 


Qy 


850 


Db 


1676 


Qy 


910 


Db 


1698 


Qy 


970 


Db 


1758 


Qy 


1030 


Db 


1818 



CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 0 9 

I II llllllll llllllllllllll llllllll I I I I lllllllllllll 
CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 113 6 

CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 369 

I Mill MMI M II MM III I MM II I M I Mill MM I INI I 1 1 1 1 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 
AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 9 

MIM MM II MMM I I MMM llllllll I llllllll II I 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGC^^ 12 56 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 9 

lllllll llllllll II Mill II llllllll llllllllllllll lllllll 
AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 

GG CTCAAGTTAAAAGTGGAG CAG CA CGTGGAG CT ATA C CAGAAATA CAG CAATG ATT C CT 54 9 

III Mill IMIMIII III IMMIIMM II I IIIIIIMIIIIIIIII 1 1 1 1 1 1 

GG CTCAAGTTAAAAGTGGAG CAG CA CGTGGAG CTGTAC CAGAAATA CAG GAACAATTCGT 1376 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 609 

MM MMMMMMMMMMMI Mill MIM II MMM I II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 14 3 6 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

MMM II Illlllllllllllll Ml Ml Ml II M MMI lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 

TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 9 

I II llllllll lllllllllllll lllllllllll II Mill II lllllll 
TTAGCGCC(^CTGCTCCTGTGA(^GCAGGGATAACA(^CTG(^GTGGACATCAACGGGT 15 56 

TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 9 

III I I llllllll I ! I I I I . I I I I I I I I I I I I llllllllllllll lllllll 
TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 

TCCTCATGGCC^CCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 9 

I M MMMM MMM I I MMM Mill I I 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 
CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 909 

llllllll llllllll MM 

GCCCTGGACACCAACTATTGCT 1697 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 96 9 

lllllllllllllllll llllll IIIMIII III III llllllllllllll lllllll 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1757 
TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 1029 

I 1 1 1 1 1 1 II II MM II II lllllillllllllllllll MMMM lllllll 

TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1817 
GTCCCTACATCTGGAGCCTAGACACTCAGTAC^GCAAGGTCCTGGCTCTGTACAACCAGC 108 9 

I llllllll llllllll Mill lllllillllllllllllll lllllllllllll 

GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 18 77 



Qy 10 9 0 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I IIIIIIIMII I I I I I I I I II I I I Ml M I : I I I I I I II I I I I I M I ! ! lllllll 
Db 1878 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

Qy 115 0 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 1209 

i I II M 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M II : 1 1 1 II 1 1 1 1 II I M 1 1 :l 1 1 1 1 1 II 1 1 I 

Db 1938 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 19 97 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1269 

lllllllllllllllllll lllllll II I II I II I I I Mill I 

Db 1998 CCTGCAAGTGCAGCTGAGGTGCCGCGCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 2 057 

Qy 12 7 0 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1321 

III MM III I I II lllllllllllllllllll IIIIIIIMII 

Db 2 058 CCGGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAG 2117 

Qy 1322 GCCACTTGGGATCGATTAAA 1341 

Mill MM I I II 
Db 2118 CCCACCTGGGGCCCCATTAA 2137 

RESULT 6 
AAT15720 

ID AAT15720 standard; cDNA; 2537 BP. 
XX 

AC AAT1572 0; 
XX 

DT 25-MAR-2003 (updated) 

DT 24-JUL-1997 (revised) 

DT 25-JAN-1980 (first entry) 
XX 

DE Pre-transforming growth factor beta 1 cDNA . 
XX 

KW transforming growth factor beta 1; wound healing; 

KW recombinant production; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT 5 'UTR 1 . . 841 

FT /*tag= a 

FT misc_feature 37.. 113 
FT " /*tag= b 

FT /note= "GC-rich region forms stable hairpin loops; 

FT similar to structural organisation of c-myc RNA, 

FT could play role in mRNA stability or in 

FT regulation of transcription" 

FT CDS 842 . .2014 

FT /*tag= c 

FT /product^ pre-TGF_beta_l 

FT mat_peptide 1676.. 2011 

FT /*tag= d 

FT /product= mature_TGF_beta_l 

FT repeat_region 2015.. 2100 
FT /*tag= e 

FT /note= "GC-rich region; possibly responsible for the 

FT fact 3 'UTR of mRNA could not be cloned as cDNA; 



FT 
r i 




may De lmporiianu ror transcription erriciency 


r i. 


repeac unic 


zui j . . zuz j 






/ Lay— L 


FT 




^U-7*±. . Z 1UU 


r i 




/ tag- g 


r l 




/note= "TATA- like sequence; no evidence that this 


PT 
r i 




functions a promoter" 


PT 
r l 


puiy/A. biyiiai 




r 1 




/ Lay = n 


PT 

r i 


iiiibc biyndi 


ZDZy . . ZDJO 


PT 
r i 




/ Lay— 1 


PT 
r i 




/ iiuLG— Loribciibus sequence liunicaiaCciy preceaes 


PT 
r l 




poiy/\-Laii ^tjenoiSL eu di j 


Y Y 
AA 






DM 


T ft 9 ft ^ "1 - A 




Y Y 
AA 






pn 


fl Q-.7AM- 1 QQfi 
u u ± y y d . 




Y Y 
AA 






PP 


05-NOV-1993; 


93US-0147364 . 


Y Y 
AA 






PR 


13-MAR-1987; 


87US-0025423 . 


PR 


22-MAR-1985; 


85US-0715142 . 


PR 


04-AUG-1989; 


89US-0389929. 


DD 
rK 


04-MAR-1992; 


92US-0845893 . 


DD 
rK 


05-NOV-1993; 


93US-0147364 . 


XX 






PA 


(GETH ) GENENTECH INC. 


XX 






D T 
r 1 


Derynck RMA, 


Goeddel DV; 


YX 






UK 


WPI; 1996-0768 


91/08 . 


DR 


P-PSDB; AAR90827. 


XX 






PT 

IT 1 


New recombinant human transforming growth factor-beta prods. - produced 


PT 


using Chinese 


hamster ovary cells, for use in diagnostic applications 


PT 


or in therapy 




Y Y 
.A. .A. 






PQ 


Example 3; Fig 


1; 2 6pp; English. 


XX 






cc 


The cDNA encodes the pre-transf ormmg growth factor (TGF) beta 1 protei: 


cc 


The nucleotide 


sequence was obtd. by an analysis of several overlapping 


cc 


cDNAs and gene 


fragments. The DNA is useful for the recombinant 


cc 


production of 


TGF beta 1, which can be used in, e.g. wound healing. 


cc 


(Revised entry 


submitted to correct sequence analysis breakdown.) 


cc 


(Updated on 25 


-MAR-2003 to correct PF field.) 


XX 






SQ 


Sequence 2537 


BP; 473 A; 893 C; 739 G; 432 T; 0 other; 



Query Match 73.5%; Score 994; DB 17; Length 2537; 

Best Local Similarity 85.7%; Pred . No. 5.8e-194; 

Matches 1148; Conservative 0; Mismatches 145; Indels 47; Gaps 2; 

Qy 10 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 9 

II III I I I I II I I I I I 1 I I I i ! IIIIIIIIIIIII IIIIIIIIIIIIII I 
Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 



Qy 



70 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 



Db 


897 


Qy 


130 


Db 


957 


Qy 


190 


Db 


1017 


Qy 


250 


Db 


1077 


Qy 


310 


Db 


1137 


Qy 


370 


Db 


1197 


Qy 


430 


Db 


1257 


Qy 


490 


Db 


1317 


Qy 


550 


Db 


1377 


Qy 


610 


Db 


1437 


Qy 


670 


Db 


1497 


Qy 


730 


Db 


1557 


Qy 


790 


Db 


1617 


Qy 


850 


Db 


1676 


Qy 


910 



I III MINIMI MMI llllllllllllll III INN 

TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 95 6 
AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 ! 1 1 : 1 1 : lllllllllll lllllllllll mm 

AG CTGGTG AAG CGG AAG CG CATCGAGG C CAT C CG CGG CCAGAT C CTGT C CAAG CTG CGG C 1016 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I MMMMMMMMMMMI I II 1 1 II I II II I II I II M Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 09 

I II IIIIIIII llllllllllllll IIIIIIII MM . 1 1 1 i 1 1 IIIIIM 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 369 

I Mill I 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 . 1 1 M I IM 1 1 1 1 1 1 1 1 I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 



IIIIIIII M MINI I I MMM MMM 



AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 9 

IIIIIM IIIIIIII II Mill II IIIIIIII MIMMMIMM IIIIIM 

AG CT CCGAGAAG CGGTACCTGAA CC CGTGTTG CT CT CCCGGG CAG AG CTG CGT CTGCTGA 1316 
GG CT CAAGTTAAAAGTGG AG CAG CA CGTGG AG CTATACCAG AAATA CAG CAATGATT C CT 54 9 

IMIIIIIIIIIIIIIIMI lllllllllllll MMM III llllll MMM 

GGCTC^GTTAAAAGTGGAG(^GCACGTGGAGCTGTACCAGAAATACAG(^aUVTTCCT 1376 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 60 9 

MM MIMIMMMIIMMIMM Mill Mill II MMM I II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 143 6 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

1 1 1 1 1 II 1 1 1 M I I II 1 1 1 1 1 M II Ml III III II II Mill IIIIIM 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 
TCAGTGCCCACTGTTCCTGTGAC^GCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 9 

I II IIIIIIII lllllllllllll lllllllllll II Mill II IMIMI 

TTAGCGCCCACTGCTCCTGTGACAGC^GGGATAACACACTGC^GTGGACATCAACGGGT 1556 
TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 9 

Ml I I Mill II 1 1 1 1 1 1 1 ! 1 1 1 II II 1 1 1 1 1 MIMMMIMM IMIMI 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 



I M II I II I II M 1 1 1 II 1 1 1 M M I M M Mill 1 1 1 1 M 1 1 1 1 1 II I II 1 1 



MINIM IMIMI; MM 

-GCCCTGGACACCAACTATTGCT 1697 



II I II I II II II I II M I M II I II 1 1 II 1 1 1 1 1 M I MIMMMIMM IIIIIM 



Db 



16 98 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1757 



Qy 97 0 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 9 

i 1 1 1 :i i ii i m mi ii ii iiiiiiiiiiiiiiiiiiii iiiiiiii iiiiiii 

Db 1758 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1817 

Qy 1030 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 9 

I llllllll llllllll Mill I I II I I I I I I I I I I I I I I I I lllllllllllll 
Db 1818 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 1877 

Qy 10 90 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I IIIIMIIIII II I II 1 1 1 1 1 II 1 1 1; 1 1 1 1 1 1 1 1 i Ml I ! HI 1 1 1 I lllllll 

Db 1878 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

Qy 1150 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 09 

1 1 1 1 M Ml I M I II I II 1 1 II I II I Ml M M M 1 1 1 III 1 1 M II 1 1 1 1 1 1 1 1 1 1 I 

Db 1938 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1997 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 126 9 

IIMIM lllllll II llllllllllll llllllll Mill I 

Db 1998 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 2 057 
Qy 127 0 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1321 

III 1 1 1 1 III II M Illllllllllllllllll IIIIMIIIII 

Db 2058 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAG 2117 

Qy 1322 CCCACTTGGGATCGATTAAA 1341 

Mill MM I I II 
Db 2118 CCCACCTGGGGCCCCATTAA 2137 

RESULT 7 
AAQ56923 

ID AAQ56923 standard; cDNA; 2537 BP. 
XX 

AC AAQ56923; 
XX 

DT 25-MAR-2003 (updated) 

DT 09-JUL-1994 (first entry) 

XX 

DE Human pre-TGF-beta- 1 . 
XX 

KW TGF-beta-1; TGF-beta-2; transforming growth factor beta-1; 

KW transforming growth factor beta-3; recombinant; wound healing; 

KW vulnerary; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT misc_structure 47.. 113 
FT /*tag= a 

FT /note= "possible hairpin loop region" 

FT CDS 842 . .2014 

FT /*tag= b 

FT mat_peptide 1676.. 2011 

FT /*tag= c 

FT polyA__signal 2515.. 2521 



FT 

XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
XX 
SQ 



/*tag= d 



US5284763-A. 



08-FEB-1994 



04 -MAR- 19 92; 92US-084 58 93 



22-MAR-1985 
13-MAR-1987 
04-AUG-1989 
04-MAR-1992 



85US-0715142. 
87US-0025423 . 
89US-0389929. 
92US-0845893 . 



(GETH ) GENENTECH INC. 

Derynk RMA, Goeddel DV; 

WPI ; 1994-056343/07 . 
P-PSDB; AAR46227. 

Nucleic acid sequences encoding transforming growth factor-beta - 
diagnostic probes, and for use in therapeutics 

Disclosure; Fig lb; 25pp; English. 

cDNA sequences were determined for human pre-TGF-beta-1 (AAQ56923) , 
pig TGF-beta-3 (AAQ56925) and human TGF-beta-3 (AAQ56926) , and the 
corresponding amino acid sequences were determined (AAR46227 -29 , 
respectively) . A genomic fragment corresponding to a human TGF- 
beta-1 exon (AAQ56924) was also isolated and its amino acid sequence 
determined (AAR4 6230) . The sequences have been used in the 
construction of vectors for the expression of recombinant TGF- 
beta . 

(Updated on 25-MAR-2003 to correct PF field.) 
Sequence 2537 BP; 473 A; 890 C; 742 G; 432 T; 0 other; 



Query Match 73 . 3%; 

Best Local Similarity 85.6%; 
Matches 1147; Conservative 



Score 992.4; DB 15; 
Pred. No. 1.2e-193; 
0; Mismatches 146; 



Length 2537; 
Indels 47; Gaps 



2; 



Qy 



Db 



1 0 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 9 

II III I I I I II lllllllllll lllllilllllll llllllllllllll I 
837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 



Qy 

Db 

Qy 
Db 



70 



897 



TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 

I II II lllllllir MINIMI Mill llllllllllllll II II I M I II 

TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 



129 



956 



189 



13 0 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

II 1 1 1 1 1 II I II I 1 1 1 1 1 1 1 1 1 1 II 1 1 1 IIMIilllll IMIMIIIII MM 

957 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 



Qy 190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I 1 1 'I II 1 1 1 1 1 1 1 M I! 1 1 1 1 1 1 1 Ml I M 1 1 1 1 III 1 1 1 Mill II II I 

Db 1017 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 



Qy 


250 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

1 II IIIIIIII llllllllllllll IIIIIIII MM MM MINI 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACGGGAGCCCGAGC 


309 


Db 


1077 


1136 


Qy 


310 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


369 


Db 


1137 


1196 


Qy 


370 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

1 1 II 1 1 II II II MIMI 1 1 IIIIII IIIIIIII 1 IIIIIIII II 1 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 


429 


Db 


1197 


1256 


Qy 
Db 


430 
1257 


AGGTCCGGGAAGCGGTGCGGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

Illllll IIIIIIII II Mill II IIIIIIII llllllllllllll Illllll 
AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


489 
1316 


Qy 


490 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

! 1 1 1 1 1 1 M 1 1 M 1 1 1 , 1 MINIMI 1 1 Mil MM 1 1 M 1 1 1 1 1 M i 1 IIIIII 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 


549 


Db 


1317 


1376 


Qy 


550 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MM M M 1 M 1 1 1 1 1 1 MM 1 1 1 II Mill Mill II IIIIII 1 M MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


609 


Db 


1377 


1436 


Qy 


610 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

MMMMIMMMMMMIMM III III Ml II II Mill Illllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


669 


Db 


1437 


1496 


Qy 


670 


TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

1 II IIIIIIII lllllllllllll lllllllllll II Mill II Illllll 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 


729 


Db 


1497 


1556 


Qy 


730 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

Ml 1 1 IIIIIIII M 1 II 1 Ml 1 1 1 1 1 MM 1 M llllllllllllll MIMM 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


789 


Db 


1557 


1616 


Qy 


790 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 II 1 1 II 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 { 1 1 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


849 


Db 


1617 


1675 


Qy 


850 


CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 

IIIIIIII IIIIIIII MM 

GCCCTGGACACCAACTATTGCT 


909 


Db 


1676 


1697 


Qy 
Db 


910 
1698 


T CAG CT C CA CGG AG AAG AA CTG CTG CGTG CG G CAG CT CTA CATTGA CT T C CGGAAGGA CC 

II 1 1 1 M 1 M MMM 1 1 1 1 II 1 II 1 M 1 1 1 II 1 II 1 llllllllllllll Illllll 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 


969 
1757 


wy 


Zj i U 


ilj^LjL. 1LjL7/\/\Lj 1 L^ILjAALLLAALjOUL i ALvJAl GLL-AATTTLTGCCTGGGGCCLT 

1 llllllllllllll II M IIIIIIIMMIIIIIIMI IIIIIIII Illllll 

TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 


1029 


Db 


1758 


1817 


Qy 


1030 


GTCCCTACATCTGGAGCCTAGAC^CTC^GTACAGC^AGGTCCTGGCTCTGTACAACCAGC 

1 IIIIIIII IMIMM Mill llllllllllllllllllll IMMIMMMI 

GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 


1089 


Db 


1818 


1877 


Qy 


1090 


ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 


1149 



Db 18 78 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

Qy 115 0 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 09 

lll.lhll III IhlllllllllMI.IIMIIIM II llhllllllMI. I 

Db 1938 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1997 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 12 69 

Illllllllllllllllll llllllllllll lllllll I Mill I 

Db 1998 CCTGCAAGTGCAGCTGAGGTCCCGCGCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 2 057 
Qy 12 7 0 CCCACCCCCGGCCGCCT CACGGGGGGTGTATTTAAGGACATCGTGCCCCAAG 1321 

in 1 1 1 1 iii ii ii r I I! I I I ' II II I II I I lllllllllll 

Db 2 058 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAG 2117 

Qy 1322 CCCACTTGGGATCGATTAAA 1341 

Mill MM I I II 

Db 2118 CCCACCTGGGGCCCCATTAA 2137 



RESULT 8 
AAV52933 

ID AAV52933 standard; cDNA; 2537 BP. 
XX 

AC AAV52 933; 
XX 

DT 25-MAR-2003 (updated) 

DT 21-DEC-1998 (first entry) 

XX 



DE Human pre- transforming growth factor-beta 1 cDNA . 
XX 

KW Transforming growth factor-beta 1; TGF-beta 1; human; ss . 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 842 . .2014 

FT /*tag= a 

FT mat_peptide 1676.. 2011 

FT /*tag= b 

FT stem_loop 37.. 113 

FT /*tag= b 

FT /note= "putative stable hairpin loop" 

FT misc_feature 2015.. 2100 

FT /*tag= c 

FT /note= "GC-rich sequence" 

FT polyA_signal 2514.. 2520 

FT ~~ /*tag= d 

XX 

PN US5801231-A. 
XX 

PD 01-SEP-1998. 
XX 

PF 30-MAY-1995; 95US- 04 544 68 . 
XX 

PR 13-MAR-1987; 87US-0025423 . 
PR 22-MAR-1985; 85US-0715142 . 



PR 04-AUG-1989; 89US-0389929 . 

PR 04-MAR-1992; 92US- 084 58 93 . 

PR 05-NOV-1993; 93US- 014 7364 . 

PR 30-MAY-1995; 95US - 04544 68 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1998-494840/42. 

DR P-PSDB; AAW78785. 
XX 

PT DNA encoding transforming growth factor-beta precursor sequence - 

PT useful for analysis to perform manipulations to increase yield of 

PT recombinant production of the protein 
XX 

PS Example 3; Fig IB 1-3; 26pp; English. 
XX 

CC This nucleotide sequence codes for the human transforming growth 

CC factor-beta 1 precursor (preTGF-beta 1, see AAW78785) . It is a 

CC composite of overlapping cDNA clones isolated from different cDNA 

CC libraries (placenta, A172 glioblastoma, HT1080 f ibroblastoma) using 

CC TGF-beta exon (see AAV52936) restriction fragments as probes. 

CC The 3' region of the sequence was determined using cloned genomic 

CC DNA. The invention relates to the recombinant production of 

CC TGF-beta. Biologically active TGF-beta is defined as being capable 

CC of inducing EGF-potentiated anchorage independent growth of target 

CC cell lines and/or growth inhibition of neoplastic cell lines. 

CC Nucleic acids encoding TGF-beta have been isolated and cloned into 

CC vectors which are replicated in bacteria and expressed in 

CC eukaryotic cells. TGF-beta recovered from transformed cells is 

CC used in known therapeutic applications. TGF-beta nucleic acids are 

CC also useful in diagnosis and identification of TGF-beta clones. 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 2537 BP; 475 A; 895 C; 736 G; 431 T; 0 Other; 

Query Match 73.1%; Score 989.2; DB 19; Length 2537; 

Best Local Similarity 85.4%; Pred . No. 5.6e-193; 

Matches 1145; Conservative 0; Mismatches 148; Indels 47; Gaps 2; 

Qy 10 CCGAGATGG CG C CTTCGGGG CTG CGG CT CTTG CCG CTG CTG CTG CCG CTG CTGTGG CTG C 69 

I I Ml I I I I II IIIIMIIIII lllllllllllll IIIIIIIIIMIII I 
Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 

Qy 70 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

i mi nil i: i ii'ii'i i him immimiimm minimi 

Db 8 97 TGGTGCTGACGCCTGGCCCGCCGGCCCCGGGACTATCCACCTGCAAGACTATCGACATGG 956 

Qy 13 0 AG CTGGTGAAGCGGAAGCG CAT CGAGGCCATTCG CGG CCAGATTCTGTCCAAGCTT CGG C 18 9 

Ml 1 1 II I M I M 1 1 II 1 1 1 1 1 1 III I II; 1 1 1 1 1 1 1 1 1 1 1 IIIIMIIIII MM 

Db 957 AG CAGGTG AAG CGG AAG CG CATCG AGG C CAT C CG CGG CCAGAT C CTGT C CAAG CTG CGG C 1016 

Qy 190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I 1 1 II M 1 1 1 M II I M II II II I Mil 1 1 1 1 1 1 1 M 1 1 1 1 1 1 Mill II II I 

Db 1017 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 



Qy 


250 


Db 


1077 


Qy 


310 


Db 


1137 


Qy 


370 


Db 


1197 


Qy 


430 


Db 


1257 


Qy 


490 


Db 


1317 


Qy 


550 


Db 


1377 


Qy 


610 


Db 


1437 


Qy 


670 


Db 


1497 


Qy 


730 


Db 


1557 


Qy 


790 


Db 


1617 


Qy 


850 


Db 


1676 


Qy 


910 


Db 


1698 


Qy 


970 


Db 


1758 


Qy 


1030 


Db 


1818 



CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 0 9 

I II IIIIIIII llllllllllilll IIIIIIII 1 1 1 1 MINIM MINI 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAA.ee 369 

I Mill 1 1 1 1 1 1 1 1 1 M I 1 1 1 11 1 i 1 1 1 M 1 1 1 1 1 : 1 1 1 1 1 1 1 [ I ! I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 
AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 4 2 9 

1 1 1 ] 1 1 1 E 1 1 II MMM I I MINI IIIIIIII I Ml MM II I 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATAT^ 12 5 6 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 9 

IIIIIII IIIIIIII II Mill II IIIIIIII II I MM IIIIIII 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 
GGCTC^GTTAAAAGTGGAGCAGC^CGTGGAGCTATACC^GAAATACAGC^ATGATTCCT 54 9 

1 1 M II M I! I II III II M M M II I M MM IIIIIIIIIIIIIIIII WWW 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTAC(^GAAATA(^GC^OWVTTCCT 1376 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 60 9 

MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 i 1 1 Mill MM II MMM I II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1436 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

MMMMMMMMMMMI Ml III III I II Mill IIIIIII 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGAGAAATTGAGGGCTTTCGCC 14 96 
TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 9 

I II IIIIIIII IMIMM I I M 1 1 1 1 1 M I II MM II IIIIIII 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACAO^CTGCAAGTGGACATCAACGGGT 15 56 
TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 9 

Ml I I IMIMM MMMMIMMMIMM llllllllllilll MMM 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 
TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 9 

I M Ml I M M Ml Ml 1 1 III II I II III I MM 1 1 1 1 1 1 1 1 1 1 1 1 II I II I 

TTCTCATGGCCy\CCCCGCTGGAGAGGGCCCAGCATCTGCAPAGCTCCCGGCACCGCCGA- 1675 
CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 90 9 

IMIMM IIIIIIII Ml 

GCCCTGGACACCAACTATTGCT 1697 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 96 9 

II Ml I M MM II Ml II M Ml Ml MIMIIII llllllllllilll MMM 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1757 
TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 9 

I llllllllllilll II II llllllllllllllllllll llllllll lllllll 

TCGG CTGGAAGTGG ATC CA CGAG C C CAAGGG CTACCATG C CAACTT CTG C CT CGGG CC CT 1817 
GTCCCTACATCTGGAGCCTAGACACTCAGTACAGC^GGTCCTGGCTCTGTAa^CCAGC 108 9 

I MMM MMM MM I III 1 1 III 1 1 1 1 1 1 1 1 1 1 1 IIIIIIIIIIIII 

GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 1877 



Qy 



1090 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 



I lllllllllll Ml 1 1 1 il 1 1 1 1 M II 1 1 1 1 1 1 1 II 1 1 II 1 1 1 II 1 1 lllllll 

Db 18 78 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

Qy 1150 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 1209 

ML Mill. Ill llllllll ILI III II II II li Mill III III Mill II III I 

Db 1938 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1997 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1269 

1 1 1 1 f i r r 1 1 j j 1 1 1 1 1 1 f iiiiiMMiii ii i ii i i i inn i 

Db 19 98 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 2 057 
Qy 1270 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1321 

Ml 1 1 1 1 III II II 1 1 1 1 ! 1 1 1 1 II 1 1 1 1 1 1 1 1 lllllllllll 

Db 2 058 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAG 2117 

Qy 1322 CC CA CTTGGGAT CGATTAAA 1341 

Mill MM I I II 

Db 2118 CCCACCTGGGGCCCCATTAA 2137 



AAQ03268; 

25-MAR-2003 (updated) 
12-AUG-1990 (first entry) 

Simian transforming growth factor-beta cDNA. 
Transforming growth factor-beta; psoriasis; TGF-beta; ss. 
Monkey . 



RESULT 9 
AAQ03268 

ID AAQ03268 standard; DNA; 1561 BP. 
XX 
AC 
XX 
DT 
DT 
XX 
DE 
XX 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
DR 
XX 



Key 

sig_j?eptide 
mat_peptide 



EP353772-A. 
07-FEB-1990 . 

04- AUG-1989; 

05- AUG-1988; 



Location/Qualifiers 
283 . .324 
/*tag= a 
1096 . . 1431 
/*tag= b 

/product=human transforming growth factor-beta 



89EP-0114458 . 
88US-0229133 . 
(ONCO ) ONCOGEN LP. 

Twardzik DR, Purchio AF, Ranchalis JE, Stevens V; 

WPI; 1990-038499/06. 
P-PSDB; AAR03743. 



PT Inhibition of proliferation of epidermal cells - 

PT used to treat psoriasis by contacting cells with compositions 

PT containing transforming growth factor-beta. 

XX 

PS Disclosure; fig 1; 2 0pp; English. 
XX 

CC TGF-beta may be used in the treatment of hyperplasia 

CC associated with acanthosis -categorised skin diseases, and 

CC in alleviating psoriatic symptoms associated with cytokine- 

CC induced phenomena. See also AAQ03269 and AAR03750. 

CC (Updated on 25-MAR-2003 to correct PA field.) 

XX 

SQ Sequence 1561 BP; 301 A; 547 C; 446 G; 267 T; 0 other; 

Query Match 72.9%; Score 986.2; DB 11; Length 1561; 

Best Local Similarity 85.8%; Pred . No. 2.1e-192; 

Matches 1150; Conservative 0; Mismatches 143; Indels 48; Gaps 3; 

'GAGATGG CG C CTTCGGGG CTG CGG CT CTTG CCG CTG CTG CTG CCG CTG CTGTGG CTG C 6 9 
III I I I I II lllllllllll lllllllilllll llllllllllllll I 



I llllllllllll lllllllilllll Mill llllllllllllll IIIIIIIMI 



AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

1 1 1 1 1 1 1 1 1 I I MM MMMMMI lllllllllll I 1 1 1 

AGCTGGTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 43 6 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I lllllllllllllllllllllll IIIIIIIMIIIIIMIIII Mill M II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 4 96 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 09 

I II llllllll llllllllllllll llllllll MM II MM l:M I 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAAC 556 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 369 

I Mill 1 1 1 1 IIIIIIIMIIIIIMIIII 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 I MM 

CGGAGGCCGACTACTACGCCAAGGAGGTC^CCCGCGTGCTAATGGTGGAAACCCACAACG 616 
AAATCTATGATAAATTCAAGGGC^CCCCCC^ 429 

IIIIIIIMI II MUM I I I MIMI llllllll I llllllll II I 

AAATCTATGACAAGTTCAAGC^GAGC^CACACAGCA 676 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 48 9 

I M II II Mill II II llllllll llllllll llllllllllllll Illllll 
AGCTCCGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 73 6 

GGCTC7W\GTTAAAAGTGGAGCAGC^CGTGGAGCTATACC^GAAATACAGCAATGATTCCT 54 9 

I M I I I I I MIIIIM I MIMI 

GGCTCAAGTTAAAAGTGGAGCAGC^TGTGGAGCTGTACC^ 796 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 609 

MM I Mill MM 1 1 1 1 [ 1 1 1 1 MM 1 1 1 1 

GGCGATACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTG 8 56 



Qy 


10 


Db 


257 


Qy 


70 


Db 


317 


Qy 


130 


Db 


377 


Qy 


i nn 

i y u 


Db 


437 


Qy 


250 


Db 


497 


Qy 


310 


Db 


557 


Qy 


370 


Db 


617 


Qy 


430 


Db 


677 


Qy 


490 


Db 


737 


Qy 


550 


Db 


797 



Qy 610 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

I III III MM III I II I llllllll III MM III II II Mill II III II 

Db 8 57 ATGTCA CCGGAG TTGTG CGG CAGTGGTTGAG C CG CGGAGGGGAAATTGAGGG CTTT CG CC 916 

Qy 670 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 9 

I II Ml Ml II MMMMMMMMMIMIMM M Mill II II Mill 

Db 917 TTAGCGCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGT 976 

Qy 73 0 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 9 

III II llllllll llllllllllllll Mill MMMMIIMM IMMM 

Db 977 TCACTACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGC 103 6 

Qy 7 90 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 9 

I MMMMIIMM llllllllllllll II Mill 1 1 1 1 1 II II 1 1 1 II 1 1 M 

Db 1037 TTCTCATGGCCACCCCACTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA- 1095 

Qy 850 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 909 

MM II II MM MM Mill 

Db 1096 GCCCTGGACACCAACTACTGCT 1117 

Qy 910 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 969 

MMMMMMMMMIMIMM III II II I III II MINIM IMMM 

Db 1118 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACC 1177 

Qy 970 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 9 

I MMMMIIMM II II IN III II 1 1 II 1 1 1 II II I III MM Ml Ml Ml 

Db 1178 TCGGCTGGAAGTGGATCCAGGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCT 123 7 

Qy 103 0 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 9 

MINIMI! llllllll Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! lllllllllllll 

Db 1238 GTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 12 97 

Qy 1090 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I I I I I I I I I II I I II I I I II II I II I II II I I M I II I II II I I II II I I I I I II II I 
Db 12 98 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 1357 

Qy 1150 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 09 

II IINN II II I II N NINNI INNINIIIINI II I 

Db 13 58 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1417 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1269 

IMMM MM II MINIM II IN llllllll Mill II 

Db 1418 CCTGCAAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGCCCGGCCCCGC 1477 
Qy 1270 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA - TCGTGCCCCAA 132 0 

llllllll III II I II II II II 1 1 INN III I Illlllllll 

Db 14 78 CCCACCCCACCCCCGCTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGTGCCCCAA 1537 

Qy 1321 GCCCACTTGGGATCGATTAAA 1341 

MINI MM I I II 

Db 1538 GCCCACCTGGGGCCCCATTAA 1558 



RESULT 10 
AAI58342 

ID AAI58342 standard; cDNA; 2742 BP. 



XX 

AC AAI58342; 
XX 

DT 22-OCT-2001 (first entry) 
XX 

DE Human polynucleotide SEQ ID NO 545. 
XX 

KW Human; nootropic; immunosuppressant; cytostatic; gene therapy; cancer; 

KW peripheral nervous system; neuropathy; central nervous system; CNS; 

KW Alzheimer's; Parkinson's disease; Huntington's disease; haemostatic; 

KW amyotrophic lateral sclerosis; Shy-Drager Syndrome; chemotactic; 

KW chemokinetic; thrombolytic; drug screening; arthritis; inflammation; 

KW leukaemia; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200153312-A1 . 
XX 

PD 26-JUL-2001. 
XX 

PF 26-DEC-2000; 2 000WO-US34263 . 
XX 

PR 21-JAN-2000; 2000US- 0488725 . 

PR 25-APR-2000; 2000US- 05523 17 . 

PR 09-JUL-2000; 2000US- 0598 042 . 

PR 19-JUL-2000; 2000US-0620312 . 

PR 03-AUG-2000; 2000US-0653450 . 

PR 14-SEP-2000; 2 000US - 0662 1 9 1 . 

PR 19-OCT-2000; 2000US-0693036 . 

PR 29-NOV-2000; 2 000US - 0727344 . 
XX 

PA (HYSE-) HYSEQ INC . 
XX 

PI Tang YT, Liu C, Asundi V, Chen R, Ma Y, Qian XB, Ren F, Wang D; 

PI Wang J, Wang Z, Wehrman T, Xu C ( Xue AJ, Yang Y, Zhang J; 

PI Zhao QA, Zhou P, Goodrich R, Drmanac RT; 
XX 

DR WPI; 2001-442253/47. 

DR P-PSDB; AAM39186. 
XX 

PT Novel nucleic acids and polypeptides, useful for treating disorders 

PT such as central nervous system injuries - 

XX 

PS Claim 1; SEQ ID NO 545; 10078pp; English. 
XX 

CC The invention relates to human nucleic acids (AAI57798 -AAI61369) and 

CC the encoded polypeptides (AAM3 8642 -AAM42213 ) with nootropic, 

CC immunosuppressant and cytostatic activity. The polynucleotides are useful 

CC in gene therapy. A composition containing a polypeptide or polynucleotide 

CC of the invention may be used to treat diseases of the peripheral nervous 

CC system, such as peripheral nervous injuries, peripheral neuropathy and 

CC localised neuropathies and central nervous system diseases, such as 

CC Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 

CC lateral sclerosis, and Shy-Drager Syndrome. Other uses include the 

CC utilisation of the activities such as: Immune system suppression, 

CC Activin/inhibin activity, chemotact ic/chemokinet ic activity, haemostatic 

CC and thrombolytic activity, cancer diagnosis and therapy, drug screening, 



CC assays for receptor activity, arthritis and inflammation, leukaemias and 

CC C.N.S disorders. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification. 

XX 

SQ Sequence 2742 BP; 526 A; 938 C; 799 G; 479 T; 0 other; 

Query Match 72.8%; Score 984.6; DB 22; Length 2742; 

Best Local Similarity 85.7%; Pred. No. 4.9e-192; 

Matches 1149; Conservative 0; Mismatches 144; Indels 48; Gaps 3; 

CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 9 

II III 1 1 1 1 II MINIUM Ml IMIMI IIIIIIIIIIIMI I 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 
TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCC^CCTGCAAGACCATCGACATGG 12 9 

I M M M M! N II 1 1 M M I M I M MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III IMIM 

TGGTGCTGACGCCTGGCCGGCCGGCCG CGGG A CTAT C CA CCTG CAAG A CTAT CGA CATGG 956 
AGCTGGTGAAGCGGAAGCGCATCGAGGCC^TTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

II 1 1 II I M 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 I IIIIIIIMM lllllllllll 1 1 1 1 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 i 1 1 1 I 1 1 1 ! 1 1 1 1 1 ! I Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 

CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 09 

I II MUM IIIIIIIIIIIMI I I I I 1 I I I I I I I I I I i I ! I I I ] I I I I I I 
CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 

CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 69 

I Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 
AAATCTATGATAAATTC^GGGCACCCCCCACAGCTTATATATGCTGTTC^ACACGTCGG 42 9 

M 1 1 1 1 1 1 1 1 II MINI I I IMIM IMIMI I IMIMI II I 

AAATCTATGACAAGTTCAAGCAGAGTACACA 12 56 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 9 

IMIMI IMIMI II IMM II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 IMIMI 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 
GGCT(^GTTAAAAGTGGAGCAGCACGTGGAGCTATAC(^GAAATAC^GQ\ATGATTCCT 54 9 

! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMIM 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTC 1376 
GG CG CTAC CTCAG CAAC CGG CTG CTGGC C C C CAGTGA CT CA C CGGAGTGG CTGT CCTTTG 609 

MM ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill Mill II IMIM I II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 143 6 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 66 9 

I I M M I M I M M I M M M M M I Ml III Ml M II Mill IMIMI 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 
TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 9 

I II MINIM 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllllllll II Mill II IMIMI 



Qy 


10 


Db 


837 


Qy 


70 


Db 


897 


Qy 


130 


Db 


957 


Qy 


190 


Db 


1017 


Qy 


250 


Db 


1077 


Qy 


310 


Db 


1137 


Qy 


370 


Db 


1197 


Qy 


430 


Db 


1257 


Qy 


490 


Db 


1317 


Qy 


550 


Db 


1377 


Qy 


610 


Db 


1437 


Qy 


670 



Db 



1497 



TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 



1556 



Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



730 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 9 

III I I llllllll 1 1 II ! 1 1 1 1 1 ! 1 1 1 1 ! 1 1 1 1 llllll IIMIII MM 

15 57 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 



790 



849 



TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

I I I M 1 1 ! 1 1 1 1 M II I II 1 1 1 1 M 1 1 1 M 1 1 Mill IM 1 1 M 1 1 1 1 1 1 1 1 1 

1617 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 



850 



909 



CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 

llllllll llllllll I I I I 
!676 GCCCTGGACACCAACTATTGCT 1697 



910 



969 



TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 

1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1: 1 1 M ! 1 1 i 1 1 1 1 1 1 1 j : I I M 1 1 1 1 1 1 1 1 1 1 1 in : 

16 98 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1757 



970 



1758 



TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 9 

I : 1 1 1 1 1 1 1 1 1 1 M I II II IIIIIMIIIII IIMIII llllllll IIMIII 

TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1817 



103 0 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 9 

I llllllll llllllll Mill M 1 1 1 1 1 1 1 M 1 1 II 1 1 1 M MIIMIIIIIII 

1818 GCCCCTACATTTGGAGCCTGGA(^CG(^GTA(^GC^GGTCCTGGCCCTGTACAACCAGC 1877 
1090 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I 1 . 1 1 1 1 1 1 1 1 I II II I II I Ml II 1 1 1 1 1 M 1 1 II 1 1 1 M I M 1 1 1 IIMIII 

1878 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 
115 0 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 0 9 

MM IMIMI III III M Ml MMMIMIMM MUM M'M I 

1938 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1997 
1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 12 69 

M I i 1 1 1 1 1 M 1 1 II I II I IIIIIMIIIII llllllll Mill I 

1998 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 2 057 
1270 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA - TCGTGCCCCAA 132 0 

III MM III M II 1 1 1 M 1 1 1 1 Ml 1 1 1 1 1 1 : Ml MUM 

2 058 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCCGTGCCCCAA 2117 

1321 GCCCACTTGGGATCGATTAAA 1341 

llllll MM I I II 
2118 GCCCACCTGGGGCCCCATTAA 213 8 



RESULT 11 
AAQ20289 

ID AAQ20289 Standard; cDNA; 1559 BP. 
XX 

AC AAQ20289; 
XX 

DT 25-MAR-2003 (updated) 

DT 16-APR-1992 (first entry) 

XX 

DE Sequence encoding simian transforming growth factor (TGF) beta-1 



XX 






K.W 


Hypertens ion 


therapy; hypotensive agent; blood pressure modulator; 




ss . 




XX 






Ub 


Monkey . 




vv 

AA 






r xi 


Key 


Locau ion/ yuai ir lers 


FT 


CDS 


262 . .282 


FT 




/*tag= a 


FT 


sig__pept ide 


283 . . 324 


FT 




/*tag= b 


FT 


CDS 


325 . .1098 


FT 




/*tag= c 


FT 


mat peptide 


1099 . . 1436 


FT 




/*tag= d 


XX 






FN 


W09119513-A. 




XX 






PD 


26-DEC-1991 . 




XX 






PF 


20-JUN-1991; 


91WO-US04449. 


XX 






PR 


20-JUN-1990; 


90US-0541221. 


XX 






FA 


(BRIM ) BRISTOL-MYERS SQUIBB CO. 


Y V 
AA 






PI 


Oleson FB, 


Comer e ski CR; 


vv 
AA 






DR 


WPI; 1992-024199/03. 


UK 


P-PSDB; AAR20124. 


vv 
AA 






F 1 


Use of transforming growth factor (TGF) -beta and their 


PT 


antagonists 


- for modulating blood pressure, for treating 


PT 


hypertension and hypotension 


XX 






D C 

Fb 


Disclosure; 


Fig l; 42pp; English. 


XX 






CC 


A new method 


for treating hypertension comprises administering a 


LL 


transforming growth factor (TGF) -beta to an individual at a dose 


CC 


effective for lowering blood pressure; the TGF-beta may be e.g. 


CC 


mature TGF-beta, TGF-beta2, a mature TGF-betal/beta2 hybrid, TGF- 


CC 


betal precursor, a latent TGF-beta2 precursor, hybrid TGF- beta 1 /TGF - 


CC 


beta2 precursor, a latent TGF-betal complex or a latent TGF-beta2 


CC 


complex . 




CC 


(Updated on 


25-MAR-2003 to correct PA field.) 


XX 






SQ 


Sequence 1559 BP; 300 A; 546 C; 446 G; 267 T; 0 other; 



Query Match 72.7%; Score 983.8; DB 13; Length 1559; 

Best Local Similarity 85.8%; Pred . No. 6.6e-192; 

Matches 1146; Conservative 0; Mismatches 142; Indels 47; Gaps 3; 

Qy 15 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 74 

I I I I I I I II llllllillll lllllllllllll llllllllllllll II III 
Db 2 61 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 320 



Qy 



75 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 134 



Illllllll lllllllllllll Mill llllllllllllll llllll 

321 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 



Db 


321 


CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 


380 


Qy 


135 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

llllllllllllllllllll INI IIMMIMM lllllllllll Mill III 

GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGGGGCTCGCC 


194 


Db 


381 


440 


Qy 
Db 


195 
441 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

llllllllllllllllllll llllllllllllllllllll Mill II II II II 
AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 


254 
500 


Qy 


255 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

llllllll IMIMIMMIM llllllll MM II IIMMIMM II III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 


314 


Db 


501 


560 


Qy 


315 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

II MMIMIIIMIMMMMIIMMMIMIMMIIMI 1 MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


374 


Db 


561 


620 


Qy 


375 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

Mill II llllll 1 1 1 llllll llllllll 1 llllllll M llllll 
TATGAC^GTTCAAGC^GAGCACACACAGC^TATATATGTTCTTC^C^CATC^GAGCTC 


434 


Db 


621 


680 


Qy 


435 


CGGG AAG CGGTG C CGGAACCTGT ATTG CTCTCT CGGG CAGAG CTG CG C CTG CTG AGG CT C 

M Mill M II llllllll IIIIIMI llllllllllllll MMMIIIMI 

CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


494 


Db 


681 


740 


Qy 


495 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


554 


Db 


741 


lllllllllll llllllll llllllll IMMIMMIMIMI llllllllll 

AAGTTAAAAGTCGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 


800 


Qy 


555 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

Illllllllllllllllllllll Mill MM Illllllll MM Illllllll 

TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 


614 


Db 


801 


860 


Qy 


615 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 


674 


Db 


861 


Ml Ml 1 Ml Ml Ml Ml 1. Ill MM III M M Mill MIMIII II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


920 


Qy 


675 


GCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

MIMIII 1 1 II II 1 II II M II 1 1 1 1 II 1 II 1 1 M 1 1 1 1 1 II M 1 II II 1 M 1 

GCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGTTCACT 


734 


Db 


921 


980 


Qy 


735 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

1 MIMIII llllllllllllll Mill 1 1 1 1 1 1 1 1 1 1 f 1 1 1 MIMIII III 

ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


794 


Db 


981 


1040 


Qy 


795 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGGTCCCGGCACCGCCGAGCCCTG 

1 1 1 1 1 II 1 M 1 II II 1 1 II 1 1 II II 1 II Mill MMIIIMIIIMIIII 

ATGGCCACCCCGCTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA 


854 


Db 


1041 


1094 


Qy 


855 


GATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGC 

MIMIII MMMIMMMIMM 


914 


Db 


1095 


1121 


Qy 


915 


TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGG 


974 



IMIIIMIM IMIMIIIIIIIMIIIMI II lllllllllll IIIIIMI III 



Db 



1122 TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGC 1181 



Qy 975 TGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCC 1034 

Mlllllllll 1 1 II llllllllllllllllllll lllllllllllllllllllll 

Db 1182 TGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCC 1241 

Qy 103 5 TACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAAC 1094 

Mill Illlllll Mill MMMMMMIMMMI 1 1 1 1 1 1 1 II 1 1 1 1 1 III 

Db 124 2 TACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAAC 13 01 

Qy 1095 CCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 1154 

MMMM 1 1 : MINI 1 1 1 1 1 M 1 1 M III 1 1 1 1 1 1 1 > 1 1 1 1 M 1 1 1 M M 1 1 1 I 

Db 13 02 CCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 1361 

Qy 115 5 TACTACGTGGGCCGCAAGGCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGC 1214 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMM 

Db 13 62 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGC 1421 

Qy 1215 AAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCAC 12 74 

II 1 1 1 1 1 ; 1 1 , 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II I II I I I I MM III I 

Db 1422 AAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGCCGGCCCCGCCCCACC 14 81 
Qy 127 5 CCCCGCCCGC CTCACCGGGGCTGTATTTAAGGACA-TCGTGCCCCAAGCCCAC 1326 

II I MIM I I Illllllllllllllllll 1 1 1 1 1 1 1 1 1 E f [ I II I 

Db 1482 CCAGCCCGGCTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGTGCCCCAAGCCCAC 1541 

Qy 132 7 TTGGGATCGATTAAA 1341 

MM I I II 
Db 154 2 CTGGGGCCCCATTAA 1556 

RESULT 12 
AAQ13392 

ID AAQ13392 standard; DNA; 1821 BP. 
XX 

AC AAQ13392; 
XX 

DT 20-NOV-1991 (first entry) 
XX 

DE Human pro-TGF-beta 1 gene. 
XX 

KW Osteogenetic; tumoricidal ; ss . 
XX 

OS Homo sapiens . 
XX 

EH Key Location/Qualifiers 

FT CDS 512.. 1684 

FT /*tag= a 

FT sig_peptide 5 12.. 5 98 

FT /*tag= b 

FT misc_RNA 599.. 1684 

FT /*tag= c 

FT /note= M pro-TGF-beta 1" 

FT mat_peptide 1346.. 1684 

FT /*tag= e 

FT /note= "TGF-beta 1" 

XX 



PN JP03180192-A. 
XX 

PD 06-AUG-1991. 
XX 

PF 07-DEC-1989; 8 9 JP - 03 1 824 3 . 
XX 

PR 07-DEC-1989; 8 9 JP - 03 18243 . 
XX 

PA (KIRI ) KIRIN BREWERY KK. 
XX 

DR WPI; 1991-271579/37. 
DR P-PSDB; AAR13813. 
XX 

PT Human pro-TGF-beta 1 prodn . , for osteo-genet ic activity - by 
PT preparing DNA chain contg. base sequence coding for human 
PT pre :pro-TGF-beta 1, forming expression vector etc. 
XX 

PS Claim 1; Fig 1; 16pp; Japanese. 
XX 

CC The DNA sequence encodes human prepro-TGF-beta 1 which can be 
CC produced by recombinant methods, it has osteogenetic and 
CC tumoricidal activity. 
XX 

SQ Sequence 1821 BP; 326 A; 679 C; 508 G; 308 T; 0 other; 

Query Match 72.7%; Score 983.4; DB 12; Length 1821; 

Best Local Similarity 85.8%; Pred. No. 8.2e-192; 

Matches 1135; Conservative 0; Mismatches 141; Indels 47; Gaps 2; 

Qy 10 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 69 

II III I I I I II IMIIIIIIII IIIMIIIIIIII llllllllllllll I 

Db 5 07 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 566 

Qy 70 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

I I Ml I I I I I I I I II I I lllllllll Mill lllllllllll.; llllllllll 
Db 567 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 62 6 

Qy 13 0 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

III I II II I 1 1 Ml 1 1 1 M I! I MM II I I i 1 1 i I i 1 1 i MM MM 

Db 627 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 686 

Qy 190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I M I M 1 1 II M I II 1 1 1 1 1 1 III II II II I II I II II II 1 1 1 1 Mill II M I 

Db 687 TGGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 74 6 

Qy 2 50 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 09 

i ii iiiiiiii 1 1 1 1 1 f 1 1 1 1 r 1 1 1 iiiiiiii mm ill 

Db 74 7 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 8 06 

Qy 310 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 69 

I Mill IM IMMM M MMI IMIIMM MUM I I I MM 

Db 8 07 CTGAGGCCGACTACTACGCCAAGGAGGTC^CCCGCGTGCTAATGGTGGAAACCC^CAACG 8 66 

Qy 370 AAATCTATGATAAATTCAAGGGC^CCCCCG^CAGCTTATATATGCTGTTCAACA CGTCGG 42 9 

llllllllll II IIIIII I I IIIIII llllllll I llllllll II I 

Db 8 67 AAATCTATGACAAGTTGAAGC^GAGTACACACAGC 926 



Qy 43 0 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 48 9 

lllllll llllllll II Mill II llllllll llllllllllllll lllllll 

Db 92 7 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 986 

Qy 4 90 GGCTC^GTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 9 

I 1 1 1 1 1 1 1 I 1 1 1 : II II I II I 1 1 1 1 1 1 : 1 1 1 I I I, II 1 1 1 1 1 1 1 II 1 1 1 llllll 

Db 987 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 104 6 

Qy 550 GGGGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 609 

MM 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 Mill Mill M 1 1 II II I II MM 

Db 1047 GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1106 

Qy 610 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

1 1 1 1 1 1 ] 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml III III II II Mill MM 

Db 1107 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCG 1166 

Qy 670 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 9 

I II llllllll lllllll HIM I I I I I E I ) I I I II Mill II lllllll 

Db 1167 TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGGAAGTGGACATCAACGGGT 122 6 

Qy 73 0 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 9 

III I I llllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! I i 1 1 1 M 1 1 1 1 1 1 1 1 1 MUM 

Db 122 7 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGG 128 6 

Qy 7 90 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCAGCTGCACAGCTCCCGGCACCGCCGAG 84 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill I III MM III I II III ' 

Db 1287 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1345 

Qy 8 50 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 9 09 

llllllll llllllll MM 

Db 134 6 GCCCTGGACACCAACTATTGCT 1367 

Qy 910 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 969 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I llllllllllllll lllllll 

Db 13 68 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1427 

Qy 97 0 TGGGCTGGAAGTGGATTCATGAACCC^GGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 9 

I llllllllllllll II II llllllllllllllllllll llllllll lllllll 

Db 14 28 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 14 87 

Qy 103 0 GTCCCTACATCTGGAGCCTAGAC^CTCAGTACAGCAAGGTCCTGGCTCTGTAO^CCAGC 108 9 

I I'M llllllll Mill 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 lllllllllllll 

Db 14 88 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 154 7 

Qy 10 90 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I lllllllllll IM 1 1 1 II I M 1 1 II I II I II Mill 1 1 I M MM II lllllll 

Db 154 8 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1607 

Qy 1150 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 120 9 

I M 1 1 1 1 1 III M 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 II IMIII II Ml M II I M I II I I 

Db 16 08 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1667 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCC^CAGCCCCGCCCACCCGGCAGGCCCGGC 126 9 

I I M 1 1 II I II I ! I II 1 1 II II II II I II I II I II I I I Mill I 

Db 1668 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 1727 

Qy 1270 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1321 



Ill 1 1 1 1 III II II lllllllllllllllllll MINIM I 

Db 172 8 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCCAA 178 7 

Qy 1322 CCC 1324 

II 

Db 1788 GCC 1790 



RESULT 13 


AAN81084 


ID 


AAN81084 standard; cDNA; 1560 BP. 


XX 




AC 


AAN81084 ; 


XX 




DT 


2 5 -MAR- 2 0 03 (updated) 


DT 


09-OCT-1990 (first entry) 


XX 




DE 


Coding sequence of simian transforming growth factor-beta 1. 


XX 




KW 


Transforming growth factor-beta 1; tumour treatment; ss cDNA. 


XX 




OS 


Cercopithecus aethiops. 


XX 




FH 


Key Location/Qualifiers 


FT 


CDS 261 . . 1433 


FT 


/*tag= a 


FT 


sig peptide 282. .323 


FT 


/*tag= b 


FT 


mat peptide 1095.. 1433 


FT 


/*tag= c 


XX 




PN 


EP293785-A. 


XX 




PD 


07-DEC-1988 . 


XX 




PF 


27 -MAY- 198 8 ; 88 EP -0108528. 


XX 




PR 


2 9 -MAY- 1987; 87US- 0055662 . 


PR 


25 -JAN- 1 988 ; 8 8US- 014 784 2 . 


XX 




PA 


(ONCO ) ONCOGEN. 


PA 


(BRIM ) BRISTOL-MYERS CO. 


XX 




PI 


Purchio AG, Gentry L, Twardzik D; 


XX 




DR 


WPI ; 1988-347488/49 . 


DR 


P-PSDB; AAP80647. 


XX 




PT 


Prodn. of simian transforming growth factor beta-1 - by culturmg 


PT 


transfected eucaryotic cells, and new precursor proteins, useful for 


PT 


treating tumours. 


XX 




PS 


Disclosure; Page ?; pp; English. 


XX 




cc 


The cDNA is prepd. from African green monkey cell line BSC-40 and is 


cc 


expressed in eukaryotic cells in plasmid pSV2 . There is 100% homology 


cc 


between mature simian and human TGF-beta 1. The plasmid also contains 



CC the SV40 promoter and a selection marker, esp . DHFR . 

CC (Updated on 25-MAR-2003 to correct PA field.) 

CC (Updated on 25-MAR-2003 to correct PI field.) 
XX 

SQ Sequence 1560 BP; 301 A; 547 C; 445 G; 267 T; 0 other; 

Query Match 72.6%; Score 982.8; DB 9; Length 1560; 

Best Local Similarity 85.8%; Pred. No. l.le-191; 

Matches 1146; Conservative 0; Mismatches 142; Indels 48; Gaps 

GGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTC 
I llll II lllllllllll Mlllllllllll I I I I I I ' 1 I 1 1 II III 



CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 134 

IIIIMIII lllllllllllll Mill llllllllllllll IMIIMI llllll 

CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATCGAGCTG 3 8 0 
GTGAAG CGGAAG CG CATCG AGG C CATT CG CGG C CAG ATT CTGTCCAAG CTT CGG CTTG C C 194 

1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 II 1 1 Mlllllllll lllllllllll Mill III 

GTGAAG CGGAAG CG CATCG AG A C CAT C CG CGG C CAG ATC CTGTCCAAG CTG CGG CT CG C C 4 4 0 
195 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 2 54 

INN IMIIMI MM Illlllllllllllllllll Mill II II II II 

4 4 i AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 500 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 314 

IIIIIMI llllllllllllll IMIIMI MM II lllllllllll II III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 560 
GCGGA CTACTA CG CCAAGGAGGT CAC CCG CGTG CTAATGGTGGAAAG CGG CAACCAAATC 374 

II IIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIM I llll Mill 

GCCGACTACTACGCC7^GGAGGTC^CCCGCGTGCTAATGGTGGAAACCCA(^ACGAAATC 62 0 
TATGATAAATTCAAGGGC^CCCCCO^CAGCTTATATATGCTGTTCAACACGTCGGAGCTC 434 

Mill II llllll I I I llllll IIIIIMI I IIIIIMI II MIMI 

TATGACAAGTTCAAGCAGAGCACACACAGCATATATATC 68 0 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 94 

II Mill II II IIIIIMI IMIIMI IMIIIIIIIIIII II II MMI 

CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 74 0 
AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 554 

Mlllllllll IMIIMI IIIIIMI M 1 1 1 1 1 1 IMIIMI I MM 

AAGTTAAAAG T CG AG CAG CATG TGG AG CTGTA C CAG AAATA CAG CAA CAA T T C CTGG CGA 8 00 
TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 614 

Illllllllllllllllllllll Mill MM IIIIMIII MM IIIIMIII 

TACCT CAG CAAC CGG CTG CTGG CGCC CAG CAACTCGCCGGAGTGGTTGTCTTTTGATGTC 860 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 674 

MMMMMMMMIMM III llll III II II MM IIIIIMI II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 92 0 
GCCC^CTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGG 734 

IIIIIMI M M I Ml 1 1 M I M 1 1 1 1 1 1 II 1 1 M MM II llllllllll I 

GCCCACTGCTCCTGTGACAGCAAAGATAAC^CACTGC^GTGGACATCAACGGGTTCACT 98 0 



Qy 


15 


Db 


261 


Qy 


75 


Db 


321 


Qy 


135 


Db 


381 


Qy 


195 


Db 


441 


Qy 


255 


Db 


501 


Qy 


315 


Db 


561 


Qy 


375 


Db 


621 


Qy 


/IOC 

435 


Db 


681 


Qy 


495 


Db 


741 


Qy 


555 


Db 


801 


Qy 


615 


Db 


861 


Qy 


675 


Db 


921 



Qy 735 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 794 

I IIMIIII llllllllllllll Mill 1 1 1 1 1 1 E 1 1 1 1 1 1 1 IIMIIII III 

Db 981 ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 104 0 

Qy 7 95 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 854 

1 1 1 1 1 1 1 1 ! M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II Mill 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1041 ATGG CCA C C CCG CTGGAG AGGG C C CAA CAT CTG CAAAG CT C C CGG CA C CG CCGA 10 94 

Qy 8 55 GATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGC 914 

IIMIIII I I I I I I I I I II I I I I I I I 
Db 10 95 GCCCTGGACACCAACTACTGCTTCAGC 1121 

Qy 915 TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGC 974 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 IIMIIII Ml 

Db 1122 TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGC 1181 

Qy 97 5 TGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCC 1034 

lllllllllll II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 II 1 1 1 1 1 1 1 

Db 1182 TGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCC 1241 

Qy 103 5 TACT^TCTGGAGCCTAGACACTC^GTACAGC 10 94 

Mill MM Mill E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 MM I Ml M Ml 

Db 1242 TACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAAC 1301 

Qy 10 95 CCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 1154 

IIMIIII MMMMIMI II II I III I III III MIMMMI I II MMMM I I 

Db 13 02 CCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 13 61 

Qy 1155 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGC 1214 

' i 1 1 1 1 1 1 1 ill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 J 1 1 ! 1 1 1 1 1 1 1 1 ]] I ! Mill 

Db 13 62 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGA 1421 

Qy 1215 AAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCAC 1274 

II MM M I Ml I II I M 1 1 IM 1 1 1 II I II I I I Mill II III 

Db 1422 AAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGCCCGGCCCCGCCCCAC 14 81 
Qy 1275 CCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA - TCGTGCCCCAAGCCCA 132 5 

III III II I 1 1 1 1 M I M M 1 1 II 1 1 1 , 1 1 1 1 1 M 1 1 1 1 1 1 M 

Db 14 82 CCCACCCCCGCTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGTGCCCCAAGCCCA 1541 

Qy 132 6 CTTGGGATCGATTAAA 1341 

I MM I I II 

Db 1542 CCTGGGGCCCCATTAA 1557 

RESULT 14 
AAQ03508 

ID AAQ03508 standard; DNA; 1560 BP. 
XX 

AC AAQ03508; 
XX 

DT 25-MAR-2003 (updated) 
DT 09-JAN-2003 (updated) 
DT 14-AUG-1990 (first entry) 
XX 

DE Simian Transforming growth factor - Betal. 



XX 

KW HIV; AIDS ; SIV; vaccine; AZT; CD 4 ; cytokines; growth 

KW factors; ds . 

XX 

OS Cebus apella. 
XX 

FH Key Location/Qualifiers 

FT CDS 267 . . 1437 

FT /*tag= a 

FT mat_peptide 1103.. 1437 

FT /*tag= b 

XX 

PN EP356935-A. 
XX 

PD 07-MAR-1990. 
XX 

PF 25-AUG-1989; 8 9EP - 0 1 157 1 9 . 
XX 

PR 25-AUG-1988; 88US- 0236698 . 
XX 

PA (ONCO ) ONCOGEN LP. 
XX 

PI Brankovan V, Lioubin M, Purchio A; 
XX 

DR WPI; 1990-068723/10. 

DR P-PSDB; AAR05663. 
XX 

PT Compsns . contg . transforming growth factor beta - 

PT used for inhibitions of HIV infection and replication in vivo. 

XX 

PS Disclosure; Fig 1; 20pp; English. 
XX 

CC TGF-beta may be used in vivo to prevent formation of synctia and 

CC inhibit HIV infection. TGF may also be used with other HIV treatments 

CC (AZT, soluble CD 4 etc.). 

CC (Updated on 09-JAN-2003 to add missing OS field.) 

CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 1560 BP; 301 A; 547 C; 445 G; 267 T; 0 other; 

Query Match 72.6%; Score 982.8; DB 11; Length 156 0; 
Best Local Similarity 85.8%; Pred. No. l.le-191; 

Matches 1146; Conservative 0; Mismatches 142; Indels 48; Gaps 3; 

Qy 15 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 74 

Ml MM M MMMMMI MMMMMMI MMIMMMMI M Ml 

Db 2 61 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 32 0 

Qy 75 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 134 

IIIIIIIII MMMMMMI lllll MMMMMMI 1 1 M II 1 1 IMMI 

Db 321 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATCGAGCTG 38 0 

Qy 13 5 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 194 

MIIIIIIMIMIIIIIIII MM MMMMMI II I I MMI Ml 

Db 381 GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 44 0 



Qy 



195 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 254 



Db 


441 


Qy 


255 


Db 


501 


Qy 


315 


Db 


561 


Qy 


375 


Db 


621 


Qy 


435 


Db 


681 


Qy 


495 


Db 


741 


Qy 


555 


Db 


801 


Qy 


615 


Db 


861 


Qy 


675 


Db 


921 


Qy 


735 


Db 


981 


Qy 


795 


Db 


1041 


Qy 


855 


Db 


1095 


Qy 


915 


Db 


1122 


Qy 


975 


Db 


1182 


Qy 


1035 



llllllllllllllllllll llllllllllllllllllll Mill II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 500 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 314 

llllllll llllllllllllll llllllll II I I II IIIIIIIIII II III 
TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 560 

GCGGAGTACTACGCGAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGGAACCAAATC 374 

II IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII I 1 1 1 1 Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 62 0 

TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 4 34 

Mill II llllll I I I llllll llllllll I llllllll II llllll 
TATGACAAGTTCAAGCAGAGCACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 68 0 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 94 

M Mill II II MMMM llllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 74 0 
AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 554 

IIIIIIIIII MMMM llllllll IIIIIIIIII II I IIIIIIIIII 

AAGTTAAAAGTCGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 800 
TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 614 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill MM MIIIIIM MM lllllllll 

TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 8 60 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 674 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I III MM III II II Mill llllllll II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 92 0 
GCCCAGTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 734 

MMMM I M I M I M I M M 1 1 M I M M M I II Mill II IMMMMI I 

GCCCACTGGTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGTTCACT 980 
TCTGGCCGCCGGGGTGACCTGGGCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 7 94 

I MMMM MIIIIIM MM Mill MMMMMMM llllllll III 

ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 104 0 



1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 II Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



GATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGC 914 

llllllll llllllllllllllllll 
GCCGTGGACACCAACTACTGCTTCAGC 1121 

TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGC 974 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 [ 1 1 1 1 M IMMMMI llllllll III 

TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGC 1181 
TGGAAGTGGATTCATGAACCCAAGGGGTACCATGCCAATTTCTGCCTGGGGCCCTGTCCC 1034 

1 1 1 i 1 1 ! 1 1 1 1 II M llllllllllllllllllll MMMM I M 1 1 1 1 1 1 1 1 1 

TGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAAGTTCTGCCTGGGGCCCTGTCCC 1241 
TAC^TCTGGAGGCTAGACACTCAGTAC^GC^GGTCCTGGCTCTGTACAACCAGCACAAC 1094 

Mill MMMM II I II llllllllllllllllllll MMMMMMM III 



Db 



124 2 TACATTTGGAGCCTGGAC^CGC^GTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAAC 13 01 



Qy 10 95 CCGGG CGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 1154 

Illlllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I II I! 1 1 1 1 1 1 1 1 III I i II 1 1 M I M . I M M 

Db 13 02 CCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 1361 

Qy 1155 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGC 1214 

I I I I I I I I II I II I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 13 62 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGA 1421 

Qy 1215 AAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCAC 1274 

I: I llllllllllllllllllllll III II I I I Mill MM III 

Db 1422 AAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGCCCGGCCCCGCCCCAC 1481 

Qy 127 5 CCCCGCCCGCCT CACCGGGG CTGTATTTAAGGACA - T CGTG CC C CAAG C C CA 132 5 

III III II I I I I I I I I I I M I I II I I I I I I I I II II II I I I II 

Db 14 82 CCCACCCCCGCTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGTGCCCCAAGCCCA 1541 

Qy 1326 CTTGGGATCGATTAAA 1341 

Mill I III 
Db 154 2 CCTGGGGCCCCATTAA 1557 

RESULT 15 
AAT05876 

ID AAT05876 standard; cDNA ; 2745 BP. 
XX 

AC AAT05876; 
XX 

DT 25-JUN-1996 (first entry) 
XX 

DE cDNA encoding transforming growth factor-beta 1. 
XX 

KW macrophage inducible nitric oxide synthase; iNOS; constitutive NOS; 

KW mterleukin-l-beta; transforming growth factor-beta; TGF-beta; ILl-beta; 

KW nitric oxide production; hypotension; inflammation; septic shock; 

KW treatment; ds . 

XX 

OS Mammalian sp . 
XX 

FH Key Location/Qualifiers 
FT CDS 842.. 2017 

FT /*tag= a 

FT /product= transforming growth factor-beta 1 

XX 

PN W09526745-A1 . 
XX 

PD 12-OCT-1995. 
XX 

PF 05-APR-1994; 94WO-US03 7 05 . 
XX 

PR 05-APR-1994; 94WO-US03 7 05 . 
XX 

PA (HARD ) HARVARD COLLEGE. 
XX 

PI Lee M, Perrella MA; 
XX 



DR WPI; 1995-358443/46. 

DR P-PSDB; AAR83054. 
XX 

PT Treatment of hypotension, esp. in septic shock - by administering 

PT transforming growth factor-beta e.g. to inhibit inducible nitric 

PT oxide synthase gene transcription 
XX 

PS Disclosure; Fig 15; 52pp; English. 
XX 

CC The cDNA encodes transforming growth factor-beta 1 (TGF-beta 1) which 

CC has been found to inhibit inducible nitric oxide synthase (iNOS) gene 

CC transcription, esp. in interleukin- 1 -beta (ILl-beta) stimulated rat 

CC smooth muscle cells, and at a dose which does not inhibit constitutive 

CC NOS. TGF-beta 1 or 2 (AAR83055) or their active fragments, can be used 

CC in the treatment of hypotension, such as that associated with severe 

CC inflammation or septic shock. 
XX 

SQ Sequence 2745 BP; 527 A; 938 C; 801 G; 479 T; 0 other; 

Query Match 72.5%; Score 981; DB 16; Length 2745; 

Best Local Similarity 85.5%; Pred . No. 2.7e-191; 

Matches 1148; Conservative 0; Mismatches 145; Indels 50; Gaps 3; 

CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 69 

II Ml 1 1 1 1 II lllllllllll IIIIIIMIMM I I 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 
TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

I MIIIMMMIMM MINIMI Mill . 1 1 1 1 1 1 1 1 : 1 1 1 1 MIMIMII 

TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 
AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I i It 1 1 1 1 1 1 1 lllllllllll MM 

AG CTGGTGAAG CGGAAG CG CAT CGAGG C CAT CCG CGG C CAGATC CTGT C CAAG CTG CGGC 1016 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I MIMIMMIMIMMIMM 1 1 1 , 1 1 1 ' 1 1 . ' I ' i 1 1 1 1 Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 09 

I II IIIIMM I i 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 369 

I Mill 1 . 1 1 1 1 > 1 1 1 1 1 1 ! I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 1 I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 
AAATCTATGATAAATTCAAGGGCACCCCC^CAGCTTATATATGCTGTTCAAC^VCGTCGG 429 

1 1 1 1 1 1 1 1 1 1 M MUM I I llllll IIIIMM I IIIIMM II i 

AAATCTATGACAAGTTCAAGCAGAGTACACAC^ 1256 
AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT- - 4 87 

M M II I IIIIMM II Mill II IIIIMM II M 1 1 II M I II I Mill 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 



Qy 


10 


Db 


837 


Qy 


70 


Db 


897 


Qy 


130 


Db 


957 


Qy 


190 


Db 


1017 


Qy 


250 


Db 


1077 


Qy 


310 


Db 


1137 


Qy 


370 


Db 


1197 


Qy 


430 


Db 


1257 


Qy 


488 



546 



llllllllll lllllllllll MM! Mill! Ill i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 



Db 


1317 


Qy 


547 


Db 


1377 


Qy 


607 


Db 


1437 


Qy 


667 


Db 


1497 


Qy 


727 


Db 


1557 


Qy 


787 


Db 


1617 


Qy 


847 


Db 


1677 


Qy 


907 


Db 


1698 


Qy 


967 


Db 


1758 


Qy 


1027 


Db 


1818 


Qy 


1087 


Db 


1878 


Qy 


1147 


Db 


1938 


Qy 


1207 


Db 


1998 


Qy 


1267 


Db 


2058 


Qy 


1319 


Db 


2118 



1317 GGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATT 1376 
TGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCT 606 

Mill II 1 1 1 1 1 1 1 1 1 . 1 M Mill I Mill Mill II MUM I II I 



TTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTC 666 

1 1 1 1 1 1 1 1 1 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml Ml III II II Mill MM 

TTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTC 14 96 
GCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACG 726 

MM II Mllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 It I II Mill II MM 

GCCTTAGCGCCC^CTGCTCCTGTGACAGCAGGGATAACACACTGC^GTGGACATCAACG 1556 
GGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCC 786 

MUM I I Mllllll llllllllllllllllllll IIIIIIMIIMM MM 

GGTTCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCC 1616 
TGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCC 84 6 

1 1 1 1 1 1 1 1 1 II I II I M 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 Mill IIIIIIMIIMM 

TGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCC 1676 



M MMMM Mllllll I 

GA GCCCTGGACACCAACTATT 1697 

GCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGG 966 

'I'll llllllllllllll MM 

GCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGG 1757 
ACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGC 1026 

MM llllllllllllll II II llllllllllllllllllll Mllllll MM 

ACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGC 1817 
CCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACC 108 6 

MM Mllllll Mllllll Mill llllllllllllllllllll llllllllll 

CCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACC 1877 
AGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGC 114 6 

MM I Ml III 1 1 1 1 1 1 ' I M , 1 M 1 1 1 1 1 1 ! M I I M 1 1 : 11 1 M ! MM 

AGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGC 1937 
CC^TCGTGTACTACGTGGGCCGCAAGCCC^AGGTGGAGCAGCTGTCCAACATGATCGTGC 1206 

'I I , I I , M I, II I 

CCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGC 1997 
GTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCC 1266 

I I M M M 1 1 1 1 1 M 1 1 1 1 1 llllllllllll II I II I I I Mill 

GCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCC 2 057 
GGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCC 1318 

MM MM III II M MIIIMII lllilllll MMMM 

CACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCC 2117 
AAGCCCACTTGGGATCGATTAAA 1341 

Mllllll MM I I II 

AAGCCCACCTGGGGCCCCATTAA 214 0 



Search completed: October 27, 2003, 19:11:23 
Job time : 416.484 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on : 



October 27, 2003, 18:35:27 ; Search time 402.152 Seconds 

(without alignments) 
9022.658 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



US-10-017-372E-34 
1353 

1 gatctggtaccgagatggcg cgattaaagcggccgcgact 1353 

IDENT1TY_NUC 

Gapop 10.0 , Gapext 1.0 

1792395 seqs, 1340900451 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



3584790 



Post -processing : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_NA: * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB.seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB. seq: * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB . seq : * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB . seq : * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB . seq : * 

9 : /cgn2_6/ptodata/2/pubpna/US0 9A_PUBCOMB . seq: * 
10: /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB. seq: * 
11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB. seq: * 
12 : /cgn2_6/ptodata/2/pubpna/US09__NEW_PUB. seq: * 
13 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB. seq: * 
14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB. seq: * 
15 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB.seq: * 
16 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB.seq: * 
17 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB.seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 
US-10-087-268-1 

; Sequence 1, Application US/10087268 



Publication No. US200301 19010A1 
GENERAL INFORMATION: 
APPLICANT: Jonsonn, Julie Ruth 
APPLICANT: Powell, Elizabeth Ellen 

TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

FILE REFERENCE: Fibrosis 

CURRENT APPLICATION NUMBER: US/ 10/ 087 , 268 
CURRENT FILING DATE: 2002-03-01 
NUMBER OF SEQ ID NOS : 6 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 1 
LENGTH: 1821 
TYPE: DNA 
ORGANISM: Human 
FEATURE : 
NAME /KEY: 5 1 UTR 
LOCATION: (1) . . (511) 
OTHER INFORMATION: 
NAME/ KEY: CDS 
LOCATION: (512) . . (1684) 
OTHER INFORMATION: 
NAME/ KEY : s ig_pept ide 
LOCATION: (512) . . (598) 
OTHER INFORMATION: 
NAME /KEY: 3 1 UTR 
LOCATION: (1685) . . (1821) 
OTHER INFORMATION: 
US-10-087-268-1 

Query Match 72.8%; Score 985; DB 14; Length 1821; 

Best Local Similarity 85.9%; Pred. No. 5.9e-255; 

Matches 1136; Conservative 0; Mismatches 140; Indels 47; Gaps 2; 

Qy 10 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 69 

II III I I I I II lllllllllll lllllllllllll llllllllllllll I 
Db 5 07 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 566 

Qy 70 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

I IIIIIIIIIIIIIIIIIIIIIIIIII Mill llllllllllllll IIIIIIIIII 
Db 567 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 62 6 

Qy 13 0 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

llllllllllllllllll II Mlllilll lllllllllll lllllllllll 1 1 1 1 

Db 627 AG CTGGTG AAG CGGAAG CG CAT CG AGG C CAT CCG CGG CCAGAT C CTGTC CAAG CTG CGG C 68 6 

Qy 190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I lllllllllllllllllllllll llllllllllllllllllll Mill II II I 
Db 687 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 74 6 

Qy 2 50 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 09 

Ml llllllll I lllllllllll M llllllll MM MM III MM Mill 

Db 74 7 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 8 06 

Qy 310 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 36 9 

I Mill Mil II llllllll II 11 1 M ; 1 1 1 ! M 1 1 1 1 1 1 I MM 

Db 8 07 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 8 66 



Qy 


370 


Db 


867 


Qy 


430 


Db 


927 


Qy 


490 


Db 


987 


Qy 


550 


Db 


1047 


Qy 


610 


Db 


1107 


Qy 


670 


Db 


1167 


Qy 


730 


Db 


1227 


Qy 


790 


Db 


1287 


Qy 


850 


Db 


1346 


Qy 


910 


Db 


1368 


Qy 


970 


Db 


1428 


Qy 


1030 


Db 


1488 


Qy 


1090 


Db 


1548 


Qy 


1150 


Db 


1608 



AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 9 

1 1 1 i 1 1 : 1 1 1 II 1 1 1 1 1 1 I I 1 1 1 1 1 1 MINIM I MINIM II I 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 92 6 
AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 9 

IIIIIII MINIM II IMM M lllllli: IMIIIIIMMII IIIIMI 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 986 
GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 9 

1 1 1 1 1 1 1 1 1 ! 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 i 1 1 MINI 

GG CT CAAGTTAAAAGTGG AG CAG CACGTGGAG CTGTA C CAGAAATA CAG CAACAATT C CT 104 6 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 60 9 

Nil 1 1 1 1 1 M 1 1 II ,11 , 1 1 1 1 M I IMM Mill M MINI I M MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1106 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

1 1 1 ' 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 ! 1 1 1 1 1 1 Ml III III M II Mill IIIIIII 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 1166 
TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 729 

I II MINIM MIMIIMMM MINIUM II IMM II IIIIIII 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1226 
TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 9 

III I I MINIM NNNNINNNNIN 1 1 i 1 1 1 1 1 ! 1 1 1 1 1 IIIIIII 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 12 86 
TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 9 

I N 1 1 1 1 1 1 N 1 1 M I N 1 1 1 IN 1 1 III 1 1 1 INN 1 1 1 1 1 1 1 1 1 1 1 1 ' 1 1 II . 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA - 134 5 
CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 9 09 

IIIIIII: NN! MM 

GCCCTGGACACCAACTATTGCT 1367 



TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 96 9 

I III 1 1 1 N II INI 1 1 II I II 1 1 1 1 III II I N II IN INN IIIIIII 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1427 
TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 9 

I lllllllllllill II II 1 1 IN I II N I ill 1 1 1 1 II NIIIMI IIIIIII 

TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1487 
GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 9 

I IIIIIII Nli Mill Ml 1 1 . 1 1 1 1 1 1 M I M I II ININ I 

GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 154 7 
ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I 1 1 1 Ml II INI M 1 1 MINN III III ill III III INI II IIIIIII 

ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1607 
TCGTGTAGTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 1209 

1 1 1 III II II 1 1 II II I II 1 1 II 1 1 III II II I Nl I N III 1 1 1 1 N I II 1 1 N II I 

TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1667 



Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1269 

1 1 1 1 1 1! 1 1 1 1 1 1 h 1 1 1 MI NI II I II I I I Mill I 

Db 1668 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 1727 

Qy 127 0 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1321 

III I I I I III II II I II I III MlllillUI IMIIIII I 

Db 1728 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCCAA 1787 

Qy 1322 CCC 1324 

I I 

Db 1788 GCC 1790 



RESULT 2 

US-10-037-270-220 

Sequence 220, Application US/10037270 
Publication No. US20030104529A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Tang , Y . Tom 
Liu, Chenghua 
Asundi, Vinod 
Zhang, Jie 
Ren, Feiyan 
Chen , Ru i - hong 
Zhao, Qing A. 
Wehrman, Tom 
Xue, Aidong J. 
Yang , Yonghong 
Wang, Jian-Rui 
Zhou, Ping 
Ma, Yunqing 
Wang, Dunrui 
Wang, Zhiwei 
Tillinghast , 



John 

Drmanac, Radoje T. 
TITLE OF INVENTION: No. US2003 01 04529Alel Nucleic Acids and 
TITLE OF INVENTION: Polypeptides 
FILE REFERENCE: 784CIP2B 

CURRENT APPLICATION NUMBER: US/10/037,27 0 
CURRENT FILING DATE: 2002-01-04 
PRIOR APPLICATION NUMBER: 09/552,317 
PRIOR FILING DATE: 2000-04-25 
PRIOR APPLICATION NUMBER: 09/488,725 
PRIOR FILING DATE: 2000-01-21 
NUMBER OF SEQ ID NOS : 1104 
SOFTWARE: pt_FL_genes Version 1.0 
SEQ ID NO 220 
LENGTH: 2 742 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (842) . . (2014) 
US-10-037-270-220 



Query Match 72.8%; Score 984.6; DB 14; Length 2742; 

Best Local Similarity 85.7%; Pred. No. 8.4e-255; 



Matches 1149; Conservative 0; Mismatches 144; indels 48; Gaps 3; 



Qy 


10 


Db 


837 


Qy 


70 


Db 


897 


Qy 


130 


Db 


957 


Qy 


190 


Db 


1017 


Qy 


250 


Db 


1077 


Qy 


310 


Db 


1137 


Qy 


370 


Db 


1197 


Qy 


430 


Db 


1257 


Qy 


490 


Db 


1317 


Qy 


550 


Db 


1377 


Qy 


610 


Db 


1437 


Qy 


670 


Db 


1497 


Qy 


730 


Db 


1557 


Qy 


790 


Db 


1617 



i i n . i ii 1 1 1 1 1 1 1 1 1! ii i r 1 1 1 him mini i iu Milium 

TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 
AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

llllllllilll! I Mllllilllllll! IMIIIIIIII IIMIIIIIII II 1 1 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

I IMIMIMMIIIMIIIIMI M M I i II II 11 1 II M I M II II I II II I 



CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 0 9 

I II IIIIIMI M I I Mill I Mil 1 1 1 1 1 1 1 r 1 1 ! 1 1 ! 1 1 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 6 9 

I Mill M 1 1 1 1 1 'I II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I III 1 1 1 1 1 M 1 1 I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 
AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 9 

MMMMII II MMM I I I II I II I II M II I I MINIM II I 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 1256 
AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 9 

TIM IIIIIMI M Mill II IIIIIMI 1 III 1 1 1 1 1 1 1 1 ll MINI 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 
GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 9 

1 1 1 1 1! M II 1 1 1 1 1 1 1 Ml 1 1 1 1 M M I II M I 1 1 II III 1 1 1 1 1 1 M II MMM 

GGCTCAAGTTAAAAGTGGAGCAGC^CGTGGAGCTGTACCAGAAATAGAGCAACAATTCCT 13 76 



Ml MMMMMMMMMIMM Mill Mill II MMM I M MM 



ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

1 1 1 1 1 1 M III 1 1 II II Ml 1 1 1 1 i I Ml Ml III II II Mill MMM 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 
TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 9 

I II IIIIIMI MIMMMMM 1 1 1 1 1 1 1 1 1 f I II Mill M MMM! 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1556 
TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGGVTGAACCGGCCCTTCCTGC 78 9 

III I I MMM MMMIMMMMIMM 1 1 1 1 1! 1 1 1 1 II 1 1 MIMM 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 
TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 9 

I I II 1 1 II M 1 1 II II 1 1 II II I M M 1 1 II 1 1 Mill MMMMIMMIMM 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 167 5 



Qy 8 50 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 909 

MINIM 1 1 1 1 1 1 1 1 II 1 1 

Db 1676 G C CCTGG ACAC CAACT ATTG CT 1697 

Qy 910 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 969 

MMMMMMMMMMMMMMMMMMI MMMMMMM MMMI 

Db 16 98 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1757 

Qy 970 TGGG CTGGAAGTGGATT CATGAAC C CAAGGG CTAC CATG C CAATTT CTG CCTGGGG CC CT 1029 

I MMMMMMM 1 1 M MMMMMMMMMM MMMM MMMI 

Db 17 58 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1817 

Qy 1030 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 1089 

I MMMM MMMM Mill I II II 1 1 II II II 1 1 1 1 M I IMIMIMIMI 

Db 1818 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 18 77 

Qy 10 90 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I MMMIMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMI 

Db 18 78 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

Qy 115 0 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 09 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1938 T CGTGT A CTACGTGGG C CG CAAG C C CAAGGTGG AG CAG CTGT C CAA CATGAT CGTG CGCT 1997 

Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1269 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II I I I Mill I 

Db 19 98 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 2 057 
Qy 127 0 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA - TCGTGCCCCAA 132 0 

Ml MM III II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMMMI 

Db 2058 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCCGTGCCCCAA 2117 

Qy 1321 GCCCACTTGGGATCGATTAAA 1341 

MMM MM I I II 

Db 2118 GCCCACCTGGGGCCCCATTAA 213 8 



RESULT 3 
US-10-087-268-4 

; Sequence 4, Application US/10087268 

; Publication No. US20030119010A1 

; GENERAL INFORMATION: 

; APPLICANT: Jonsonn, Julie Ruth 

; APPLICANT: Powell, Elizabeth Ellen 

TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

; FILE REFERENCE: Fibrosis 

; CURRENT APPLICATION NUMBER: US/10/087 , 268 
; CURRENT FILING DATE: 2002-03-01 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 4 

LENGTH: 1821 

TYPE: DNA 

ORGANISM: Human 

FEATURE : 



NAME /KEY: 5 ' UTR 
LOCATION: (1) . . (511) 
OTHER INFORMATION: 
NAME /KEY: CDS 
LOCATION: (512) . . (1684) 
OTHER INFORMATION: 
NAME/ KEY : sig__peptide 
LOCATION: (512) . . (598) 
OTHER INFORMATION: 
NAME /KEY: 3 ' UTR 
LOCATION: (1685) (1821) 
OTHER INFORMATION: 
US-10-087-268-4 



Query Match 72.7%; Score 983.4; DB 14; Length 1821; 

Best Local Similarity 85.8%; Pred. No. 1.6e-254; 

Matches 1135; Conservative 0; Mismatches 141; Indels 47; Gaps 2; 



Qy 


10 


CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 


69 






II III MM II MINIUM MIMMMMM 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


507 


CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 


566 


Qy 


70 


TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 


129 






1 MMIIMIIMM 1 1 1 1 1 1 1 1 1 Mill 1 .1 MMIIIMI 




uu 


DO f 


IULj ILtU 1LjALLjL.L IbbLuubuLbbULbLbbtjAL 1A 1 LtJALL 1 OLAAbAL 1AI LbACATGG 


626 


Qy 


130 


AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 


189 






MMIIMMMMMMMMMMIMM MIMMIMI MMIMIMI MM 




UD 


(TOT 

bz / 


AbL 1 bb I bAAbLbbAAbLbLA 1 CGAGGLCATCLbLGGLCAGATCCTGTCCAAGCTGCGGC 


686 


wy 


ion 


1 1 bbbA^bbbbbbbAbLbAbbbbbAbb IbLLbLLLbbLLLbb lbLL IbAbbbAbTALTbb 


24 9 






1 1 M M 1 1 1 II M M II II 1 1 II 1 MIMMMMMMIMM Mill II II 1 




Db 


687 


TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 


746 


Qy 


250 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 


309 






1 II IIIIIMI MMIMMIMM llllllll MM Mill 




Db 


747 


CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 


806 


Qy 


310 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 


369 






1 Mill 1 1 M 1 1 II 1 1 1 1 1 II 1 1 1 h 1 1 M 1 II 1 1 II II 1 II 1 1 1 1 1 1 1 Mil 




Db 


807 


CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


866 


Qy 


370 


AAATCTATGATAAATTC^GGGCACCCCCCACAGCTTATATATGCTGTTC^ACACGTCGG 


429 






1 II 1 II M 1 1 II llllll 1 1 MIMI II 1 M II 1 1 IIIIIMI II 1 




Db 


867 


AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATAT^ 


926 


Qy 


430 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 
MIMI IIIIIMI II Mill II llllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllll 


489 


Db 


927 


AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


986 


Qy 


490 


GGCT(^GTTAAAAGTGGAGCAGC^CGTGGAGCTATACC^GAAATA(^GCAATGATTCCT 


549 






1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 M 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 M 1 II 1 1 1 llllll 




Db 


987 


GGCTCAAGTTAAAAGTGGAGCIAGCACGTGGAGCTGT^ 


1046 


Qy 


550 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 


609 


Db 




MM MIIMIMIIIIIIMIIIIII Mill Mill II MIMI 1 II MM 




1047 


GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


1106 



Qy 610 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

MMI Mill II IMMIMII III III III II II Mill lllllll 

Db 1107 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 1166 

Qy 67 0 TCAGTGCCCACTGTTCCTGTGACAGOW^GATAACACACTCCACGTGGAAATTAACGGGT 72 9 

I II llllllll IIIIIIIIIIIII I I I I I I I I I I I II Mill II lllllll 
Db 1167 TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 122 6 

Qy 73 0 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 9 

mi i i Miiiiii 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 iiiiiii 

Db 1227 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 12 8 6 

Qy 7 90 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 9 

I I I I I I I I I I I I ! I I I I I I I i I I I I I I I I I I I I Mill I I I I I r I I I I I I 1 I I I I I 
Db 12 87 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 134 5 

Qy 8 50 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 90 9 

llllllll llllllll I I I I 
Db 134 6 GCCCTGGACACCAACTATTGCT 1367 

Qy 910 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 969 

1 1 II 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 E lllllll 

Db 13 68 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 14 27 

Qy 970 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 9 

I iiiiiiiiiiiiii ii ii immmmmmmimm miiiiii iiiiiii 

Db 1428 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 14 87 

Qy 103 0 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 9 

i 1 1 r 1 1 1 1 1 iiiiiiii mmi 1 1 1 1 1 1 ' 1 1 1 1 r 1 1 1 m 1 1 iiiiiiiiiiiii 

Db 14 88 GCCCCTA(^TTTGGAGCCTGGACACGCAGTAC^GC^AGGTCCTGGCCCTGTACAACCAGC 1547 

Qy 10 90 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 9 

I 1 1 1 1 M 1 1 1 1 1 II II 1 1 1 1 1 II ! 1 1 1 1 III 1 1 ! 1 1 1 1 II 1 1 II 1 1 1 1 1 1 IIMIM 

Db 154 8 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1607 

Qy 1150 TCGTGTACTACGTGGGCCGC^GCCC^GGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 09 

MIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI I 

Db 16 08 TCGTGTACTACGTGGGCCGC^GCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1667 

Qy 1210 CCTGC^GTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 12 69 

1 1 J 1 1 1 1 1 1 1 1 r j 1 1 1 e j r ii iiiiii miiiiii mmi i 

Db 1668 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 1727 
Qy 1270 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1321 

Ml MM III II II I i ! 1 1 ! 1 1 1 1 1 1 . 1 1 1 1 1 MIIIIII I 

Db 1728 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCCAA 1787 

Qy 1322 CCC 1324 

II 

Db 1788 GCC 1790 



RESULT 4 

US-09-948-002-28 

; Sequence 28, Application US/09948002 



Publication No. US20030050265A1 
GENERAL INFORMATION: 
APPLICANT: Nicholas M. Dean 
APPLICANT: Susan F. Murray 

TITLE OF INVENTION: ANTI SENSE MODULATION OF TRANSFORMING GROWTH 
TITLE OF INVENTION: FACTOR BETA EXPRESSION 
FILE REFERENCE : ISPH-0607 

CURRENT APPLICATION NUMBER: US/09/948,002 
CURRENT FILING DATE: 2000-09-05 
PRIOR APPLICATION NUMBER: 09/661,753 
PRIOR FILING DATE: 2000-09-14 
PRIOR APPLICATION NUMBER: 60/154,546 
PRIOR FILING DATE: 1999-09-17 
NUMBER OF SEQ ID NOS : 71 
SEQ ID NO 28 
LENGTH: 274 5 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (842) . . . (2017) 
US-09-948-002-28 

Query Match 72.5%; Score 981; DB 11; Length 2745; 

Best Local Similarity 85.5%; Pred. No. 7.8e-254; 

Matches 1148; Conservative 0; Mismatches 145; Indels 50; Gaps 3; 
Qy 10 C CGAG ATGG CG C CTTCGGGG CTG CGG CTCTTG CCG CTG CTG CTG C CG CTG CTGTGG CTG C 6 9 

II III 1 1 1 1 II Mil;!; M 1 1 1 1 1 1 II I ill l 1 1 1 ] I ! f l : 1 1 ! 1 1 I 

Db 83 7 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 

Qy 7 0 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 129 

I IMIIMIIIIIIIII MINIMI Mill 1 1 1 1 1 1 1 1 , 1 1 1 1 MINIUM 

Db 8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 

Qy 13 0 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 189 

IIIIIIIIMIMIIIIIIIMIIIIIMII Mill' 1 1 1 1 1 1 1 1 1 1 1 MM 

Db 957 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 

Qy 190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I II M 1 1 1 1 1 1 1 1 1 1 'I I II 1 1 h MNIMIMINIMIMI INN M II I 

Db 1017 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 

Qy 25 0 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 09 

I II MINIM 1 1 1 1 1 1 1 1 1 1 1 f 1 1 IMMMI MM I M II 1 1 1 1 M 1 1 1 1 1 

Db 107 7 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 

Qy 310 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 69 

I Mill 1 1 1 1 1 1 E I E I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 f 1 1 r 1 1 1 1 1 1 I MM 

Db 113 7 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAA CG 1196 

Qy 37 0 AAATCTATGATAAATTC^AGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 9 

III, MUM II Ml I I MINI IMMMI I IMMMI II I 

Db 1197 AAATCTATGACAAGTTCAAGCAGAGTACAC^CAGCATAT^ 12 56 

Qy 43 0 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT- - 4 87 

Ml Ml IMMMI II Mill II MINIM MMMNIIMM Mill 



Db 


1257 


Qy 


488 


Db 


1317 


Qy 


547 


Db 


1377 


Qy 


607 


Db 


1437 


Qy 


667 


Db 


1497 


Qy 


727 


Db 


1557 


Qy 


787 


Db 


1617 


Qy 


847 


Db 


1677 


Qy 


907 


Db 


1698 


Qy 


967 


Db 


1758 


Qy 


1027 


Db 


1818 


Qy 


1087 


Db 


1878 


Qy 


1147 


Db 


1938 


Qy 


1207 


Db 


1998 


Qy 


1267 


Db 


2058 



1317 GG AGG CTCAAGTTAAAAGTGGAG CAG CA CGTGG AG CTG TA CCAG AAATA CAG CAA CAATT 1376 
CCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCT 606 

MINI 1 1 . 1 1 1 1 1 1 >i 1 1 M , |! 1 1 1 Mill Mill M MMM I II I 

CCTGGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTT 143 6 
TTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTC 666 

MMMMIMMMIMMMMIMM III III III II II Mill MM 

TTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTC 14 96 
667 GCCTC^GTGCCCACTGTTCCTGTGAC^GC^AAGATAACACACTCCACGTGGAAATTAACG 726 

MM M MMMM MMMIMMM MIMMMM M Mill II MM 

14 97 GCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACG 1556 
GGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCC 786 

MMM I I IIIIIIII 1 1 1 J 1 1 1 1 1 f 1 1 1 1 1 1 1 1 J I IMMMMMMI MM 

GGTTCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCC 1616 
TGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTGCCGGCACCGCC 84 6 

MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 Mill MMIMMMMMI 

TGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCC 1676 



II MMMM Mill 



GCTTCAGCTCCACGGAGAAGAACTGCTGCGTGGGGCAGCTCTACATTGACTTCCGGAAGG 966 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMMMMMI MM 

GCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGG 1757 
ACCTGGGCTGGAAGTGGATTCATGAACGCAAGGGCTACCATGCCAATTTCTGCCTGGGGC 1026 

MM IMMMMMMI II II II 1 1 1 II 1 1 1 1 II II 1 1 1 II MMMM MM 

ACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGC 1817 
CCTGTCCCTACATCTGGAGCCTAGACAGTCAGTACAGCAAGGTCCTGGCTCTGTACAACC 1086 

MM IIIIIIII MMMM Mill I II 1 1 1 1 1 II II II 1 1 II 1 1 MMMMM 

CCTGCCCCTACATTTGGAGCGTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACC 18 77 
AGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGC 114 6 

MM MIMMMM I II 1 1 1 II I II II II II I M II 1 1 1 II II 1 1 1 1 1 1 1 II MM 

AGCATAACCCGGGCGCCTCGGCGGCGCGGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGC 193 7 
CCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGC 12 06 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ( I i I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M M 

CCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGC 1997 
GTTGCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCC 1266 

I I M 1 1 1 1 1 1 1 f I M II 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 II II I II I I I Mill 

GCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCC 2 057 
GGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCC 1318 

MM MM III M II IMMMMMMMMM MINIM 

CACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACAGCGTGCCCC 2117 



Qy 



1319 AAGCCCACTTGGGATCGATTAAA 1341 



Db 




RESULT 5 
US-09-948-002-1 

; Sequence 1, Application US/09948002 

; Publication No. US2 003 0050265A1 

; GENERAL INFORMATION: 

; APPLICANT: Nicholas M . Dean 

; APPLICANT: Susan F. Murray 

; TITLE OF INVENTION: ANTI SENSE MODULATION OF TRANSFORMING GROWTH 
; TITLE OF INVENTION: FACTOR BETA EXPRESSION 
; FILE REFERENCE: ISPH-0607 

; CURRENT APPLICATION NUMBER: US/09/948 , 002 
; CURRENT FILING DATE: 2000-09-05 

PRIOR APPLICATION NUMBER: 09/661,753 

PRIOR FILING DATE: 2000-09-14 
; PRIOR APPLICATION NUMBER: 60/154,546 
; PRIOR FILING DATE: 1999-09-17 
; NUMBER OF SEQ ID NOS : 71 
; SEQ ID NO 1 

LENGTH: 2 094 
TYPE: DNA 

ORGANISM: Mus musculus 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (868) . . . (2040) 
US-09-948-002-1 

Query Match 63.7%; Score 862.2/ DB 11; Length 2094 ; 

Best Local Similarity 81.7%; Pred. No. 6.6e-222; 

Matches 1040; Conservative 0; Mismatches 188; Indels 45; Gaps 2; 
Qy 10 C CGAGATGG CGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTG CTGTGG CTG C 69 



Db 




Qy 



70 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 



Db 




Qy 



13 0 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 



Db 




Qy 



190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 



Db 




Qy 



250 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 09 



Db 




Qy 



310 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 369 



Db 


1163 


Qy 


370 


Db 


1223 


Qy 


430 


Db 


1283 


Qy 


490 


Db 


1343 


Qy 


550 


Db 


1403 


Qy 


610 


Db 


1463 


Qy 


670 


Db 


1523 


Qy 


730 


Db 


1583 


Qy 


790 


Db 


1643 


Qy 


850 


Db 


1702 


Qy 


910 


Db 


1724 


Qy 


970 


Db 


1784 


Qy 


1030 


Db 


1844 


Qy 


1090 


Db 


1904 


Qy 


1150 



I II lllllllllll II II 1 1 1 1 1 1 1 1 M 1 1 ! 1 1 Ml 1 1 1 1 1 1 1 II 1 1 1 1 

CCGAAGCGGACTACTATGCTAAAGAGGTCACCCGCGTGCTAATGGTGGACCGCAACAACG 1222 

AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 9 

IIIIMII III III I II I I Mill IIIIMII I Mill Mill I 

CC^TCTATGAGAAAACC^U^GAC^TCTCAC^CAGTATATATATGTTCTTCAATACGTC^G 1282 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 89 

I I IMMMI Mill Mill llilll II II MMMMMM III II 

ACATTCGGGAAGCAGTGCCCGAACCCCCATTGCTGTCCCGTGCAGAGCTGCGCTTGCAGA 1342 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 9 

I I II I II IMMMI M Mill II lllllllllll Mill MMM 

GATTAAAATC^GTGTGGAGCAAC^TGTGGAACTCTACCAGAAATATAGCAACAATTCCT 14 02 

GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 609 

MM Mill I MMMMMM MMM III I II lllllllllll MM 

GGCGTTACCTTGGTAACCGGCTGCTGACCCCCACTGATACGCCTGAGTGGCTGTCTTTTG 14 62 

ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 66 9 

I Mill IIIIMII 1 1 1 1 1 1 1 1 i I ! J I II MM I III MM Mill 

ACGTCACTGGAGTTGTACGGCAGTGGCTGAACCAAGGAGACGGAATACAGGGCTTTCGAT 1522 

TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 729 

MM II Mill II Ml lllllllllllll II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 MMM 

TCAGCGCTCACTGCTCTTGTGACAGCAAAGATAACAAACTCCACGTGGAAATCAACGGGA 1582 

TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 9 

Ml I M Mill 1 1 [ 1 1 1 1 MMM II I MM II MMM I 

TCAGCCCCAAACGTCGGGGCGACCTGGGCACCATCCATGACATGAACCGGCCCTTCCTGC 1642 

TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 9 

M I II 1 1 1 1 ! I II 1 1 M I II I M M M I M I Ml M 1 1 IMMMI II 

TCCTCATGGCCACCCCCCTGGAAAGGGCCCAGCACCTGCACAGCTCACGGCACCGGAGA- 1701 

CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 909 

Illllllllllllllll MM 

GCCCTGGATACCAACTATTGCT 1723 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 969 

Minimi iiiiiiiiiiiiii miiiiiiii iiiiiiiiiii iiiiiiiM 

TCAGCTCCACAGAGAAGAACTGCTGTGTGCGGCAGCTGTACATTGACTTTAGGAAGGACC 1783 



MM lllllllllll II II 1 1 M MM II 1 1 1 II MM II I II MM MM 

TGGGTTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGTCTGGGACCCT 
GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 

I MM M IIIIMII 1 1 II I MMIMIMMIMM || || MMM I 

GCCCCTATATTTGGAGCCTGGACACACAGTACAGCAAGGTCCTTGCCCTCTACAACCAAC 
ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 

MMMMMM MIM I 1 1 1 1 1 1 1 II 1 1 II II II 1 1 1 1 1 1 1 1 II I II 1 1 1 1 1 

ACAACCCGGGCGCTTCGGCGTCACCGTGCTGCGTGCCGCAGGCTTTGGAGCCACTGCCCA 
TCGTGTACTACGTGGGCCGC7U\GCCCAAGGTGGAG(^GCTGTCCAACATGATCGTGCGTT 

MM MMM 1 ' 1 1 i 1 1 J 1 1 1 1 1 i 1 1 1 1 1 1 1 1 lllllllllllll MM I 



Db 1964 TCGTCTACTACGTGGGTCGC^AGCCCAAGGTGGAGCAGTTGTCCAACATGATTGTGCGCT 2 023 



Qy 1210 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 12 6 9 

1 1 M 1 1 1 1 1 1 1 1 1 1 M ! IIIIIMIIMI Ml Ml IMMMMMMM 

Db 2 024 CCTGCAAGTGCAGCTGAAGCCCCGCCCCGC CCCGCCCCTCCCGGCAGGCCCGGC 2077 

Qy 1270 CCCACCCCCGCCC 1282 

Ml MMMMI 

Db 2078 CCCGCCCCCGCCC 2090 



RESULT 6 

US-09-948-002-27 

Sequence 27, Application US/09948002 
Publication No. US20030050265A1 
GENERAL INFORMATION: 
APPLICANT: Nicholas M. Dean 
APPLICANT: Susan F. Murray 

TITLE OF INVENTION: ANTI SENSE MODULATION OF TRANSFORMING GROWTH 
TITLE OF INVENTION: FACTOR BETA EXPRESSION 
FILE REFERENCE: ISPH-0607 

CURRENT APPLICATION NUMBER: US/09/948 , 002 
CURRENT FILING DATE : 2000-09-05 
PRIOR APPLICATION NUMBER: 09/661,753 
PRIOR FILING DATE: 2000-09-14 
PRIOR APPLICATION NUMBER: 60/154,546 
PRIOR FILING DATE: 1999-09-17 
NUMBER OF SEQ ID NOS : 71 
SEQ ID NO 27 
LENGTH: 1585 
TYPE: DNA 

ORGANISM: Rattus norvegicus 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (413) . . . (1585) 
US-09-948-002-27 

Query Match 62.3%; Score 842.6; DB 11; Length 1585; 

Best Local Similarity 82.1%; Pred. No. 1.2e-216; 

Matches 999; Conservative 0; Mismatches 179; Indels 39; Gaps 1; 
Qy 10 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 69 

M Ml MM MMMMIMMI MMMI MMI M M I MMM I 

Db 4 08 CCCCCATGCCGCCCTCGGGGCTGCGGCTGCTGCCGCTTCTGCTCCCACTCCCGTGGCTTC 4 67 

Qy 7 0 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

MMMMMMI M MM MMI Mill 1 1 1 M I II I M I M M I II II II M 

Db 4 68 TAGTGCTGACGCCCGGGAGGCCAGCCGCGGGACTCTCCACCTGCAAGACCATCGACATGG 527 

Qy 13 0 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

1 1 1 M I II 1 1 MMMMIMMI MMI II MMMI 1 1 1 1 f 1 1 1 II Ml 

Db 528 AGCTGGTGAAACGGAAGCGCATCGAAGCCATCCGTGGCCAGATCCTGTCCAAACTAAGGC 587 

Qy 190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I Mill MMMMMMIMM II Mill I I ' I I I I , I I I Mill II II I 
Db 58 8 TCGCCAGTCCCCCGAGCCAGGGGGAGGTACCGCCGGGCCCGCTGCCCGAGGCGGTGCTCG 64 7 



Qy 


250 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

1 1 1 llllllll MIMIIIIIIMI II II II II 1 II lllllllllllll 

CTTTGTACAACAGCACCCGCGACCGGGTGGCAGGCGAGAGCGCTGACCCGGAGCCCGAGC 


309 


Db 


648 


707 


Qy 


310 


CAGAGGCGGACTACTAGGCCAAGGAGGTCACCCGCGTGGTAATGGTGGAAAGCGGCAACC 

1 IIIIIIIMIIIIIIIIIII II 11:1 1 III IIMIM IIIIIM II MM 

CCGAGGCGGACTACTACGCCAAAGAAGTCACCCGCGTGCTAATGGTGGACCGCAACAACG 


369 


Db 


708 


767 


Qy 


370 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

1 1 II 1 1 1 1 1 III III 1 II 1 1 Mill llllllll 1 Mill Mill 1 

CAATCTATGACAAAACCAAAGACATCACACACAGTATATATATGTTCTTCAATACGTCAG 


429 


Db 


768 


827 


Qy 


430 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

1 1 M 1 II M 1 Mill Mill llllll II II llllllllllllllll II 

ACATTCGGGAAGCAGTGCCAGAACCCCCATTGCTGTCCCGTGCAGAGCTGCGCCTGCAGA 


489 


Db 


828 


887 


Qy 


490 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

1 MUM II llllllll Mill II II lllllllllll Mill llllll 

GATTCAAGTCAACTGTGGAGCAACACGTAGAACTCTACCAGAAATATAGCAACAATTCCT 


549 


Db 


888 


947 


Qy 


550 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MM Mill 1 MMMMIMI MUM III 1 II lllllllllll MM 

GGGGTTACCTTGGTAACCGGCTGCTGACCCCCACTGATACGCCTGAGTGGCTGTCTTTTG 


609 


Db 


948 


1007 


Qy 


610 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

1 Mill llllllll MIMMIIMM II MM 1 III MM llllll 

ACGTCAGTGGAGTTGTCCGGCAGTGGCTGAACCAAGGAGACGGAATACAGGGCTTTCGCT 


669 


Db 


1008 


1067 


Qy 


670 


TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

MMMi mil M I I : ■ nil, ii mi 

TCAGTGCTCACTGCTCTTGTGACAGCAAAGATAATGTACTCCACGTGGAAATCAATGGGA 


729 


Db 


1068 


1127 


Qy 


730 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

Ml II II II MIMMIM llllll II 1 MIMIIII IMIIMIM 

TCAGTCCCAAACGTCGAGGTGACCTGGGCACCATCCATGACATGAACCGACCCTTCCTGC 


789 


Db 


1128 


1187 


Qy 


790 


TCCTCATGGCCAGCCCGCTGGAGAGGGCCCAGCACCTGCACAGGTCCCGGCACCGCCGAG 

1 M 1 1 II 1 M 1 1 1 1 1 1 Mill Mill M 1 IIMIM II 

TCCTCATGGCCACCCCCCTGGAAAGGGCTCAACACCTGCACAGCTCCAGGCACCGGAGA- 


849 


Db 


1188 


1246 


Qy 


850 


CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 

IMIIIIIIIIIIIIMIIMI 


909 


Db 


1247 


1268 


Qy 


910 


TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCGGGAAGGACC 

MIMMIM MIMMIIMM II IMIIMI MIMMIM 1 1 1 1 1 1 1 1 1 1 

TCAGCTCCACAGAGAAGAACTGCTGTGTACGGCAGCTGTACATTGACTTTAGGAAGGACC 


969 


Db 


1269 


1328 


Ov 


970 


MM II II III II II IIMIIIIIIIIIIIMIII Mill IMIIIMI 

TGGGTTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGGCAACTTCTGTCTGGGGCCCT 




Db 


1329 


1388 


Qy 


1030 


GTGCCTAGATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 

1 IMIIMI IMIIMI Mill MMMIMMMIMI II II llllllll 1 

GCCCCTACATTTGGAGCCTGGACACACAGTACAGCAAGGTCCTTGCCCTCTACAACCAAC 


1089 


Db 


1389 


1448 


Qy 


1090 


ACAA CCCGGG CG CGTCGG GGG CG C CGTG CTG CGTG CCG CAGG CG CTGGAG C CACTGCC CA 


1149 



Illlllllll II II II I llllllllllllllllllll lllllllllllllll 

Db 144 9 AC^CCCGGGTGCTTCCGCATCACCGTGCTGCGTGCCGCAGGCTTTGGAGCCACTGCCCA 15 08 

Qy 1150 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 09 

1 1 1 1 lllllllllll IIIIIIIIIMIIIIIIIIII Mill I IIILIIIMl I 

Db 1509 TCGTCTACTACGTGGGTCGCAAGCCCAAGGTGGAGCAGTTGTCCAACATGATCGTGCGCT 15 68 

Qy 1210 C CTG CAAGTG CAG CTGA 1226 

Illllllllllllllll 
Db 1569 CCTGCAAGTGCAGCTGA 1585 



RESULT 7 

US-09-756-283A-19 

Sequence 19, Application US/09756283A 
Patent No. US20020151478A1 
GENERAL INFORMATION : 
APPLICANT: Chernajovsky, Yuti 
APPLICANT : Dreja, Hanna Stma 
APPLICANT: Adams, Gillian 
TITLE OF INVENTION: Latent Fusion Protein 
FILE REFERENCE: 0623.1000000 
CURRENT APPLICATION NUMBER : US/09/756 , 283A 
CURRENT FILING DATE: 2001-01-09 
NUMBER OF SEQ ID NOS : 100 
SOFTWARE: Patent In version 3.0 
SEQ ID NO 19 
LENGTH: 13 76 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: LAP-mlFNbeta construct 
NAME/ KEY: CDS 
LOCATION: (1) . . (1368) 
US-09-756-283A-19 

Query Match 48.6%; Score 657.2; DB 10; Length 1376; 

Best Local Similarity 88.3%; Pred . No. 8.3e-167; 

Matches 726; Conservative 0; Mismatches 93; Indels 3; Gaps 1; 

Qy 15 ATGG CG C CTTCGGGG CTG CGG CTCTTG C CG CTG CTG CTG CCG CTG CTGTGG CTG CTAGTG 74 

III I I I I II lllllllllll IMIIMMIIII IIIIIIIIIIIIII II III 
Db 1 ATGC CG C C CTC CGGG CTG CGG CTG CTG C CG CTG CTG CTA C CG CTG CTGTGG CTACTGGTG 60 

Qy 75 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 134 

1 1 1 : I ' 1 1 ' , 1 1 1 IIIIIIIII Mill 1 1 M 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 61 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 12 0 

Qy 135 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 194 

1 1 1 1 1 f 1 1 M 1 1 1 i M M 1 1 1 1 1 1 . , 1 1 1 1 1 III IMMIIMM Mill III 

Db 121 GTGAAG CGG AAG CG CAT CGAGG C CAT C CG CGG C CAG AT C CTGT CCAAGCTG CGG CT CG CC 18 0 

Qy 195 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 2 54 

IIIMIIIIIIIIIMIMI : I II 1 1 1 II 1 1 1 , 1 1 1 1 1 1, Mill II M II M 

Db 181 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 24 0 



Qy 



2 55 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 314 



Db 


241 


Qy 


315 


Db 


301 


Qy 


375 


Db 


361 


Qy 


435 


Db 


421 


Qy 


492 


Db 


481 


Qy 


552 


Db 


541 


Qy 


612 


Db 


601 


Qy 


672 


Db 


661 


Qy 


732 


Db 


721 


Qy 


792 


Db 


781 



MINIM MMMIIIMIM IIIIMM MM 1 1 1 1 II M I II 1 1 1 II I III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 3 00 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 374 

M 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 I MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 360 
TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 4 34 

Mill M MUM I I MIMI IIIIMM I IIIIMM II MINI 

TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 42 0 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT GAGG 4 91 

II IIIIMM M Mill II IIIIMM IMMIMIIMM Mill MM 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGAGG 4 8 0 
CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGG 551 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 II IIIIMM 

CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGG 54 0 
CGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGAT 611 

II 1 1 II 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 MIM Mill II MIMI I II MIMI 

CGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGAT 6 00 

GTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTC 671 

I I I II I II I II I II I I I I I I I M I III III Ml II M Mill I'M 
GTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTT 660 

AGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTC 731 

II IIIIMM MIMMMMM MIMIIMM II Mill II 1 1 1 1 1 1 1 1 

AGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTC 72 0 
AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTC 7 91 

I I I IIIIMM 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 MIMMMIMM llllllll 

ACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCGTTTCCTGCTT 780 
CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGC 833 

I M I M MM M Ml I M , 1 1 1 1 1 I ' I M I Mill III 

CTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGC 822 



RESULT 8 

US-09-756-283A-21 

; Sequence 21, Application US/09756283A 

; Patent No. US20020151478A1 

; GENERAL INFORMATION: 

; APPLICANT: Cherna;} ovsky , Yuti 

; APPLICANT: Dre j a , Hanna Stina 

; APPLICANT: Adams, Gillian 

TITLE OF INVENTION: Latent Fusion Protein 
; FILE REFERENCE: 0623.1000000 

; CURRENT APPLICATION NUMBER: US/0 9/756 , 283A 

; CURRENT FILING DATE: 2001-01-09 

; NUMBER OF SEQ ID NOS : 100 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 21 

LENGTH: 13 52 

TYPE: DNA 



ORGANISM : Artificial Sequence 
FEATURE : 

OTHER INFORMATION: ml FNbeta -LAP construct 
NAME /KEY: CDS 
LOCATION: (1) . . (1344) 
US-09-756-283A-21 

Query Match 43.9%; Score 594.6; DB 10; Length 1352; 

Best Local Similarity 87.3%; Pred . No. 5.7e-150; 

Matches 664; Conservative 0; Mismatches 94; Indels 3; Gaps 1; 

GACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTGGT 13 6 

I II llllll Mil III llllllllllllll lllllllllllllllll 

GGCGGGAGGGGGCTCAGCGGCCGCACTATCCACCTGCAAGACTATCGACATGGAGCTGGT 641 
GAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAG 196 

'IIMII' Ml MM 1 1 ■ 1 1 1 1 1 1 Mill MM 

GAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAG 7 01 
CCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTA 2 56 

1 1 1 1 1 ! 1 1 1 II 1 1 1 1 1 1 1 Mill MMI II M II! Ill Mill II M II II II 

CCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTA 761 
CAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGC 316 

llllll 1 1 1 1 1 1 1 1 1 1 1 i 1 1 llllllll MM lllllllllllllllll Mill 

CAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGC 821 
GGACTACTACGCCAAGGAGGTC^CCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTA 376 

MMI MMIMMM IMMMMMIMIMMMMM I MM 1 1 1 1 1 1 1 

CGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTA 881 
TGATAAATTCAAGGGCACCCCCC^CAGCTTATATATGCTGTTCTVACACGTCGGAGCTCCG 436 

Ml II IIMII I I IIMII llllllll I llllllll II llllllll 

TGACAAGTTCAAGCAGAGTACACACAGCAT^ 94 1 

GGAAG CGGTGC CGGAAC CTGTATTG CTCT CT CGGG CAG AG CTG CG C CTG CT GAGGCT 4 93 

llllllll II Mill II llllllll IIMIIIIIIIIII Mill IIMII 

AGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGAGGCT 1001 
CAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCG 553 

I Ml M 1 1 M I M 1 1 1 1 1 M M 1 1 M 1 1 1 II M 1 1 M I M I Ml M IIMIIMM 

CAAGTTAAAAGTGGAGCAGC^CGTGGAGCTGTACC^GAAATAC^GCAAC^ATTCCTGGCG 1061 
CTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGT 613 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 Mill Mill M llllll I II Mllllll 

ATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGT 1121 
CACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAG 673 

I IMM I M I M I II Ml 1 1 III Ml III II II Mill Mllllll II 

CACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAG 1181 
TGCCC^CTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAA 73 3 

Mllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMMIMI II Mill II 1 1 1 1 1 M 1 1 1 

CGCCC^CTGCTCCTGTGACAGC^GGGATAAC^C^CTGC^GTGGA(^TCAACGGGTTCAC 1241 
TTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCT 793 

I I Mllllll M 1 1 1 1 M l MM 1 1 1 M 1 1 I II 1 1 1 1 1 M 1 1 1 1 IIMII II 



Qy 


77 


Db 


582 


Qy 


137 


Db 


642 


Qy 


197 


Db 


702 


Qy 


257 


Db 


762 


Qy 


317 


Db 


822 


Qy 


377 


Db 


882 


Qy 


437 


Db 


942 


Qy 


494 


Db 


1002 


Qy 


554 


Db 


1062 


Qy 


614 


Db 


1122 


Qy 


674 


Db 


1182 


Qy 


734 



Db 1242 TACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCT 1301 



Qy 7 94 CATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCT 834 

llllllll MM II IIIIMMMI Mill 1 1 1 1 

Db 13 02 CATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCT 134 2 



RESULT 9 

US-09-911-904-167 

Sequence 167, Application US/09911904 
Publication No. US20030096234A1 
GENERAL INFORMATION : 
APPLICANT: Farr, Spencer B. 
APPLICANT: Pickett, Gavin G. 
APPLICANT: Neft, Robin Eileen 
APPLICANT: Dunn, II, Robert Thomas 
TITLE OF INVENTION: CANINE TOXICITY GENES 
FILE REFERENCE: 400742000200 
CURRENT APPLICATION NUMBER : US/09/ 91 1 , 904 
CURRENT FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: US 60/220,057 
PRIOR FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 386 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 167 
LENGTH : 4 8 9 
TYPE: DNA 

ORGANISM: Canis familiaris 
FEATURE : 

NAME/ KEY: misc_f eature 
LOCATION: (1) . . . (489) 

OTHER INFORMATION: n = A, T, C or G 
US-09-911-904-167 

Query Match 27.1%; Score 366.4; DB 11; Length 489; 

Best Local Similarity 86.1%; Pred. No. l.le-88; 

Matches 445; Conservative 0; Mismatches 32; Indels 40; Gaps 2; 
Qy 778 GGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCC 8 37 

I llllllllllilllllllllllllll 1 1 M 1 1 1 1 ill 1 1 h 1 1 1 ! 1 1 1 1 1 II I II 

Db 1 GACCCTTCCTGCTCCTCATGGCCACCCCACTGGAGAGGGCCCAGCACCTGCACAGCTCCC 6 0 

Qy 838 GGCACCGCCGAGCCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATA 8 97 

1 1 1 1 HIM MINIMI I 

Db 61 GGCAGCGCCG GGCCCTGGACA 81 

Qy 8 98 CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACT 957 

I M 1 1 1 1 1 1 1 II II 1 1 II 1 1 1 1 1 1 M 1 1 II 1 1 M 1 1 I; [ 1 1 E 1 1 1 1 1 F 1 1 1 1 1 1 f I f 

Db 82 CCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTCCGGCAGCTCTACATTGACT 141 

Qy 958 TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 1017 

MM Mill MM III MM Mill IIMIIII Mill II II 1 1 1 1 

Db 142 TCCGCAAGGATCTGGGCTGGAAGTGGATCCATGAGCCCAAGGGTTACCACGCTAACTTCT 201 

Qy 1018 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 1077 

MINIMUM! IIMIIII Mill 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 I 

Db 2 02 GCCTGGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 261 



Qy 1078 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 1137 

III II I MIMI M M MIMM Ml M I MM IM M I III I ' Ml MM MIMI 1 1 ! 

Db 262 TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 321 

Qy 113 8 AGCCACTGCCCATCGTGTACTACGTGGGCCGC^AGCCCAAGGTGGAGCAGCTGTCCAACA 1197 

M M 1 1 1 M M M I M I M I M I M M 1 1 M I M 1 1 M M II M M M M 1 1 1 1 1 1 1 II 

Db 322 AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCGAACA 381 

Qy 1198 TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCG-CCCACAGCCCCGCCCACC 1256 

1 1 1 1 ! 1 . 1 1 1 1 1 M M I II MMI I MM II M I MM M I, 1 1 Ml III II I 

Db 382 TGATCGTGCGCTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGTCCGGCAGGCCCCGCCCAC 441 

Qy 1257 CGGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGG 12 93 

IIIIIM llllllll lllllllllll I Mill 
Db 442 CGGCAGGNCCGGCCCCGCCCCCGCCCGCTGCGCCGGG 478 



RESULT 10 
US-09-813-271B-1 

; Sequence 1, Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 

biologically active protein 
NUMBER OF SEQUENCES : 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US2 0 02 0 1 15834Alart i s Patent Department 

STREET: 564 Morris Avenue 

CITY: Summit 

STATE: New Jersey 

COUNTRY: USA 

ZIP: 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC- DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/8 13 , 271B 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/0271 9 

FILING DATE: 12-Jul-95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25-Jul-94 
ATTORNEY/AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J . . 

REGISTRATION NUMBER: 2264 0 

REFERENCE/ DOCKET NUMBER : 4 -20039C/C1C1/USN 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (908) 522-6940 

TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 



LENGTH: 33 9 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : doubl e 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL: NO 
IMMEDIATE SOURCE: 

CLONE: E. coli LC137/pPLMu . hTGF-betal (DSM 5656) 
FEATURE : 

NAME /KEY : CDS 
LOCATION: 1 . .336 

OTHER INFORMATION: /product = "human TGF-betal" 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-09-813-271B-1 

Query Match 22.8%; Score 308.6; DB 10; Length 339; 

Best Local Similarity 94.4%; Pred. No. 3.5e-73; 

Matches 320; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 
Qy 8 8 8 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 94 7 

MINIM 1 1 f i 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 ; 1 1 1 1 : 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 6 0 

Qy 94 8 TACATTGA CTTC CGGAAGGAC CTGGG CTGGAAGTGGATTCATGAA C C CAAGGG CTAC CAT 1007 

iiiiiiiiiMiii iMiiiii i 1 1 1 f i f 1 1 1 1 1 1 1 1 1 m ill, i : ; 1 1 1 1 : i 

Db 61 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 12 0 

Qy 10 08 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 1067 

Mill IMIIIII IMIIIII IMIIIII IMIIIII Mill I LIUM I 

Db 121 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 180 

Qy 1068 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 1127 

IMIIIII MMMIMIMM Mil 1 .1 1 1 1 1 1 1 F 1 1 1 1 1 1 f I E 1 1 1 1 1 1 1 1 

Db 181 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 24 0 

Qy 1128 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1187 

MMMIMIMM 1 1 1 MM 1 1 1 Ml M II I M MUM I II M II 1 1 1 II I I Ml 

Db 241 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 3 00 

Qy 1188 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 122 6 

M M M I M II II I I II I I I IIIIMIIMIMIII I 
Db 3 01 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 339 



RESULT 11 
US-09-906-158-3 

; Sequence 3, Application US/09906158 

; Publication No. US20030078217A1 

; GENERAL INFORMATION: 

; APPLICANT: Brett P. Monia 

; APPLICANT: Susan M . Freier 

; TITLE OF INVENTION: ANTI SENSE MODULATION OF TRANSFORMING GROWTH FACTOR-BETA 3 
EXPRESSION 

; FILE REFERENCE: RTS-0257 

; CURRENT APPLICATION NUMBER : US/ 09/ 9 06 , 158 
; CURRENT FILING DATE: 2001-07-14 
; NUMBER OF SEQ ID NOS : 168 



SEQ ID NO 3 
LENGTH: 2 574 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (254) . . . (1492) 
US-09-906-158-3 

Query Match 17.9%; Score 241.6; DB 11; Length 2574; 

Best Local Similarity 53.8%; Pred. No. 5.9e-55; 

Matches 670; Conservative 0; Mismatches 519; Indels 57; Gaps 6; 
Qy 4 0 TGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCG 99 

III III I I Ml II III MINI II I I II 

Db 261 TGCACTTGCAAAGGGCTCTGGTGGTCCTGGCCCTGCTGAACTTTGCCACGGTCAGCCTCT 32 0 

Qy 100 GACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCA 15 9 

MINIM MM III I III I I I I Ml MM I I II MM 

Db 321 CTCTGTCCACTTGCACC^CCTTGGACTTCGGCC^CATCAAGAAGAAGAGGGTGGAAGCCA 38 0 

Qy 160 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 219 

M I M Mill II llllll MM 1 1 1 1 1 1 1 1 1 1 I M II 

Db 381 TTAGGGGACAGATCTTGAGCAAGCTCAGGCTCACCAGCCCCCCTGAGCCAACGGTGATGA 44 0 

Qy 220 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 27 9 

I I II Mil M Mill IMIIIIIMI MMI llllll 

Db 441 CCCACG TCCCCTATCAGGTCCTGGCCCTTTACAACAGCACCCGGGAGCTGCTGG 4 94 

Qy 28 0 CCGGGGAAAGTGTCGAACCGGAGCCCG AGCCAGAGGCGGACTACT 324 

II II II MM I I I Ml MM MM 

Db 4 95 AGGAGATGCATGGGGAGAGGGAGGAAGGCTGCACCCAGGAAAACACCGAGTCGGAATACT 554 

Qy 325 ACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAAT 384 

I Mill MM I llllll I I I II 

Db 555 ATGCCAAAGAAATCCATAAATTCGACATGATCCAGGGGCTGGCGGAGCACAACGAACTGG 614 

Qy 385 TC^GGGC^CCCCCC^(^GCTTATATATGCTGTT(^ACACGTCGGAGCTCCGGGAAGCGG 44 4 

M I I I I I I III I III III 

Db 615 CTGTCTGCCCTAAAGGAATTACCTCCAAGGTTTTCCGCTTCAATGTGTCCTCAGTGGAGA 674 

Qy 44 5 TG C CGGAA C CTGTATTG CT CT CT CGGG CAGAG CTG CG C CTG CTG AGG CTCAAG TTAAAAG 504 

II III I I I I I III I II III I 

Db 675 AAAATAGAACCAACCTATTCCGAGCAGAATTCCGGGTCTTGCGGGTGCCCAACCCCAGCT 734 

Qy 5 05 TGGAG CAG CA CG TGGAG CTATA C CAGAAATACAG CAATG 543 

Ml llllll II I I I II II 

Db 735 CTAAGCGGAATGAGCAGAGGATCGAGCTCTTCCAGATCCTTCGGCCAGATGAGCACATTG 7 94 

Qy 544 ATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGT 603 

llllll III II Ml II I I I I I I MINIUM 

Db 795 CCAAACAGCGCTATATCGGTGGCAAGAATCTGCCCACACGGGGCACTGCCGAGTGGCTGT 854 

Qy 604 CCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTT 663 

' MUM, | NUN | | | M I II I llllll I I III 

Db 8 55 CCTTTGATGTCACTGACACTGTGCGTGAGTGGCTGTTGAGAAGAGAGTCCAACTTAGGTC 914 



Qy 664 TTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACA CACTCCACGTGG 717 

I I I I I llllll I III III I I II III I 

Db 915 TAGAAATCAGCATTCACTGTCCATGTCACACCTTTCAGCCCAATGGAGATATCCTGGAAA 974 

Qy 718 AAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACC 777 

I Ml III I I I I I I III II III II 

Db 975 ACATTCACGAGGTGATGGAAATCAAATTCAAAGGCGTGGACAATGAGGATGACCATGGCC 1034 

Qy 778 GGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCC 837 

I llllll llllll I II I I I I I 

Db 1035 GTGGAGATCTGGGGCGCCTCAAGAAGCAGAAGGATCACCACAACCCTCATCTAATCCTCA 10 94 

Qy 838 GG CACCGCCGAGCCCTGGATACCAACGACTACAAGGATGACGACGACAA- - -GG 8 8 8 

I I II I II II I I I I I II I I III II 

Db 1095 TGATGATTCCCCCACACCGGCTCGACAACCCGGGCCAGGGGGGTCAGAGGAAGAAGCGGG 1154 

Qy 88 9 CCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCT 94 8 

I 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 II I MM Milllllll Mill I MM 

Db 1155 CTTTGGACACCAATTACTGCTTCCGCAACTTGGAGGAGAACTGCTGTGTGCGCCCCCTCT 1214 

Qy 94 9 ACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATG 10 08 

IMMMMMM MM 1 1 1 1 M I II 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 IMMMM III 

Db 1215 ACATTGACTTCCGACAGGATCTGGGCTGGAAGTGGGTCCATGAACCTAAGGGCTACTATG 1274 

Qy 1009 CCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGG 1068 

MM llllll M M II II III II I II llllll llllll II 

Db 1275 CCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGTGCAGACACAACCCACAGCACGG 1334 

Qy 1069 TCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGC 1128 

I MM IMMMM I Mill I II II II MM 1 1 1 1 1 1 1 1 1 1 1 I 

Db 1335 TGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCCC 13 94 

Qy 1129 AGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGC 118 8 

Ml MMMM III Mill 1 1 1 1 1 1 1 II II I I Mill 1 1 f 1 1 1 1 1 1 1 

Db 13 95 AGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCAGC 14 54 

Qy 118 9 TGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGC 1234 

I IMMMM I III II II II II MIIM I MM I 

Db 14 55 TCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGCTGAGACCCCAC 1500 



RESULT 12 
US-10-028-158-20 

; Sequence 20, Application US/10028158 

; Publication No. US20020110833A1 

; GENERAL INFORMATION: 

; APPLICANT: Caniggia, Isabella 

; APPLICANT: Post, Martin 

; APPLICANT: Lye, Stephen 

; TITLE OF INVENTION: METHODS TO DIAGNOSE A REQUIRED REGULATION OF 

; TITLE OF INVENTION: TROPHOBLAST 

; FILE REFERENCE: 11757. 38USWO 

; CURRENT APPLICATION NUMBER : US/l 0/028 , 158 

; CURRENT FILING DATE: 2001-12-20 

PRIOR APPLICATION NUMBER: US/09/38 0 , 662 
; PRIOR FILING DATE: 1999-12-21 
; PRIOR APPLICATION NUMBER: PCT/CA98 /00 18 0 



PRIOR FILING DATE: 1998-03-05 
PRIOR APPLICATION NUMBER: US 60/03 9,919 
PRIOR FILING DATE: 1997-03-07 
NUMBER OF SEQ ID NOS : 24 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 20 
LENGTH: 2574 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (254) . . (1492) 
US-10-028-158-20 

Query Match 17.9%; Score 241.6; DB 13; Length 2574; 

Best Local Similarity 53.8%; Pred . No. 5.9e-55; 

Matches 670; Conservative 0; Mismatches 519; Indels 57; Gaps 6; 

Qy 4 0 TGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCG 99 

III III I I III II Ml MUM II I I I I 

Db 2 61 TGCACTTGCAAAGGGCTCTGGTGGTCCTGGCCCTGCTGAACTTTGCCACGGTCAGCCTCT 32 0 

Qy 100 GACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCA 15 9 

MINI I I I I Ml I III II I I Ml I I I | | | M I I I I 
Db 321 CTCTGTCCACTTGCACCACCTTGGACTTCGGCCACATCAAGAAGAAGAGGGTGGAAGCCA 380 

Qy 160 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 219 

II I II Mill II MUM MM 1 1 ■ 1 1 1 : 1 I I II II 

Db 3 81 TTAGGGGACAGATCTTGAGC^GCTCAGGCTCACCAGCCCCCCTGAGCO^ACGGTGATGA 44 0 

Qy 22 0 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 279 

I I II I II I II Mill IIIIIMIIII Mill II I I I I 

Db 441 CCCACG TCCCCTATCAGGTCCTGGCCCTTTACAACAGCACCCGGGAGCTGCTGG 4 94 

Qy 2 80 CCGGGGAAAGTGTCGAACCGGAGCCCG AGCCAGAGGCGGACTACT 3 24 

II II II MM I I I III MM MM 

Db 4 95 AGGAGATGCATGGGGAGAGGGAGGAAGGCTGCACCCAGGAAAACACCGAGTCGGAATACT 5 54 

Qy 3 25 ACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAAT 3 84 

I Mill II II I III I I I I I I II 

Db 555 ATGCCAAAGAAATCCATAAATTCGACATGATCCAGGGGCTGGCGGAGCACAACGAACTGG 614 

Qy 3 85 TCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGG 4 44 

III I I I I I III I I I I I I I 

Db 615 CTGTCTGCCCTAAAGGAATTACCTCCAAGGTTTTCCGCTTCAATGTGTCCTCAGTGGAGA 674 

Qy 445 TGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAG 5 04 

II III I I I I I MM II III I 

Db 675 AAAATAGAACCAACCTATTCCGAGCAGAATTCCGGGTCTTGCGGGTGCCCAACCCCAGCT 734 

Qy 5 05 TGGAGCAGCACGTGGAG CTATACCAGAAATACAGCAATG 543 

III I I I I II II I I I II II 

Db 735 CTAAGCGGAATGAGCAGAGGATCGAGCTCTTCCAGATCCTTCGGCCAGATGAGCACATTG 7 94 

Qy 544 ATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGT 603 

Mill III II III III I I II I I I I I I I I I I I 
Db 7 95 CCAAACAGCGCTATATCGGTGGCAAGAATCTGCCCACACGGGGCACTGCCGAGTGGCTGT 8 54 



Qy 6 04 C CTTTGATGTCA CCGGAGTTGTG CGG CAGTGG CTG A C C CG CAG AG AGG CTATAGAGGGTT 663 

Illlillllll I IIIIII IIIIMII I IIIIII I I III 

Db 8 5 5 CCTTTGATGTCACTGACACTGTGCGTGAGTGGCTGTTGAGAAGAGAGTCCAACTTAGGTC 914 

Qy 664 TTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACA CACTCCACGTGG 717 

I I I I I IIIIII I III III I I II III I 

Db 915 TAGAAATCAGCATTCACTGTCCATGTCACACCTTTCAGCCCAATGGAGATATCCTGGAAA 974 

Qy 718 AAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACC 777 

I III Ml I I I I I I III II III M 

Db 97 5 ACATTCACGAGGTGATGGAAATCAAATTCAAAGGCGTGGACAATGAGGATGACCATGGCC 1034 

Qy 778 GGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCC 837 

I IIIIII IIIIII lllllll I 

Db 1035 GTGGAGATCTGGGGCGCCTCAAGAAGCAGAAGGATCACCAG^ACCCTCATCTAATCCTCA 1094 

Qy 838 GG CACCGCCGAGCCCTGGATACCAACGACTACAAGGATGACGACGACAA - - - GG 8 8 8 

I I II I II II I I I I I II I I I II II 

Db 1095 TGATGATTCCCCCACACCGGCTCGACAACCCGGGCCAGGGGGGTCAGAGGAAGAAGCGGG 1154 

Qy 88 9 CCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCT 94 8 

I MM Mill lllllllll II I MM llllllllll Mill I MM 

Db 1155 CTTTGGACACCAATTACTGCTTCCGCAACTTGGAGGAGAACTGCTGTGTGCGCCCCCTCT 1214 

Qy 94 9 ACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATG 1008 

I I M 1 1 1 1 M 1 1 MM IIIIIIIIIMIIII I IIIIMII 1 1 1 M i 1 1 1 III 

Db 1215 ACATTGACTTCCGACAGGATCTGGGCTGGAAGTGGGTCCATGAACCTAAGGGCTACTATG 1274 

Qy 10 09 CCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGG 1068 

MM IIIIII II II II II III II I II IIIIII MUM II 

Db 12 75 CCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGTGCAGACACAACCCACAGCACGG 1334 

Qy 1069 TCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGC 1128 

I MM lllllllll I Mill I II II M MM IMMMIMI I 

Db 1335 TGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCCC 13 94 

Qy 112 9 AGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC7UVGCCCAAGGTGGAGCAGC 1188 

Ml 1 1 III Mill lllllll II II II Mill IMIIIIIII 

Db 13 95 AGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCAGC 14 54 

Qy 118 9 TGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGC 1234 

I lllllllll I III II II M II lllllll MM I 

Db 14 55 TCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGCTGAGACCCCAC 1500 



RESULT 13 
US-09-957-458B-9 

; Sequence 9, Application US/09957458B 
; Publication No. US20030166271A1 
; GENERAL INFORMATION: 
; APPLICANT: Chen, Una 

TITLE OF INVENTION: Method for growing stem cells 
; FILE REFERENCE: P66567US0 

; CURRENT APPLICATION NUMBER : US/09/957 , 458B 

; CURRENT FILING DATE: 2001-09-21 

; PRIOR APPLICATION NUMBER: PCT/EP00/08247 



PRIOR FILING DATE : 2000-08-24 
PRIOR APPLICATION NUMBER : EP 99116533 
PRIOR FILING DATE: 1999-08-24 
NUMBER OF SEQ ID NOS : 10 
SOFTWARE: Patent In version 3.2 
SEQ ID NO 9 
LENGTH: 4 382 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Vector for transforming supporting cell with a foreign to 
express 

OTHER INFORMATION: a gene product of interest 
US-09-957-458B-9 

Query Match 17.5%; Score 236.4; DB 12; Length 4382; 

Best Local Similarity 53.0%; Pred. No. 1.7e-53; 

Matches 662; Conservative 0; Mismatches 531; Indels 57; Gaps 5; 
Qy 30 CTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGG 8 9 

1 1 1 1 ii i iii iii i i iii i iii Mini i i 

Db 4 55 CTGC^GCCOVTGCACTTGCAAAGGGCTCTGGTAGTCCTGGCCCTGCTGAACTTGGCCACA 514 

Qy 90 CCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGC 14 9 

I I MINIM MM II I Ml I I I I III MM I 

Db 515 ATCAGCCTCTCTCTGTCCACTTGCACCACGTTGGACTTCGGCC^CATC^AGAAGAAGAGG 574 

Qy 150 ATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAG 2 09 

I II MIMI I II Mill II MINI MM 1 1 1 1 1 1 1 1 } I I 

Db 575 GTGGAAGCCA.TTAGGGGACAGATCTTGAGCAAGCTCAGGCTCACCAGCCCCCCTGAGCCA 634 

Qy 210 GGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGC 269 

II II I I II Mil II Mill IMMMMM Mill 

Db 635 T CGGTGATG A CC CA CG TCCCCTATCAGGTCCTGGCACTTTACAACAGCACCCGG 688 

Qy 270 GACCGGGTAGCCG GGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 314 

M I I I I MM II III I III 

Db 68 9 GAGTTGCTGGAAGAGATGCACGGGGAGAGGGAGGAAGGCTGCACTCAGGAGACCTCGGAG 748 

Qy 315 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 374 

I I I Mill Mill III I I I III I I I II 

Db 74 9 T CTG AGTA CTATG C CAAAGAGAT C CATAAATTCG A CATG AT C CAGGG A CTGG CGG AG CA C 8 08 

Qy 375 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 434 

MM I II II I I II I Ml Ml 

Db 8 09 AATGAA CTGG CCGT CTG CCCCAAAGGAATTACCTCTAAGGTTTTTCGTTTCAATGTGTCC 868 

Qy 43 5 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 94 

III Ml II II I II MM I III 

Db 8 69 TCAGTGGAGAAAAATGGAACCAATCTGTTCCGGGCAGAGTTCCGGGTCTTGCGGGTGCCC 92 8 

Qy 4 95 A AGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAA 533 

I II I II MM I I II III 

Db 92 9 AACCCCAGCTCCAAGCGCACAGAGCAGAGAATTGAGCTCTTCCAGATACTTCGACCGGAT 988 

Qy 534 TACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCG 593 

I M I II M I M II II MIMI II I M I 



Db 



98 9 GAGCACATAGCCAAGCAGCGC^^ 104 8 



Qy 5 94 GAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCT 653 

II llllllll II llllllll I llllll llllllll I llllll I 

Db 104 9 GAATGGCTGTCTTTCGATGTCACTGACACTGTGCGCGAGTGGCTGTTGAGGAGAGAGTCC 1108 

Qy 654 ATAGAGGGTTTTCGCCTCAGTGCCCACTGTTCCTGTGACA G CAAAG ATAAC 7 04 

I llll I llll llllll! I III III III I II 

Db 1109 AACTTGGGTCTGGAAATCAGCATCCACTGTCCATGTCACACCTTTCAGCCCAATGGAGAC 1168 

Qy 7 05 ACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATT 764 

lllll I llllll llll I 

Db 1169 ATACTGGAAAATGTTCATGAGGTGATGGAAATCAAATTCAAAGGAGTGGACAATGAAGAT 122 8 

Qy 765 CACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCGGCTGGAGAGGGCCCAGCAC 824 

II lllll llll I llll III 

Db 122 9 GACCATGGCCGTGGAGACCTGGGGCGTCTCAAGAAGCAAAAGGATCACCACAACCCACAC 128 8 

Qy 825 CTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACGACTACAAGG ATGAC 878 

III I I I II lllll III I III I 

Db 1289 CTGATCCTCATGATGATCCCCCCACACCGACTGGACAGCCCAGGCCAGGGCAGTCAGAGG 134 8 

Qy 879 GACGACAAGGCGCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTG 938 

I I I MINIMI lllll MINIMI II I llll III II 

Db 134 9 AAGAAGAGGGCCCTGGACACCAATTACTGCTTCCGCAACCTGGAGGAGAACTGCTGTGTA 14 08 

Qy 93 9 CGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAG 9 98 

II I II II I III IN MM M llllllll III I II lllll III 

Db 14 09 CGCCCCCTTTATATTGACTTCCGGCAGGATCTAGGCTGGAAATGGGTCCACGAAGCTAAG 14 68 

Qy 999 GGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAG 1058 

II Ml II llllll II II II II III M I III llllll 

Db 14 69 GGTTACTATGCCAACTTCTGCTCAGGCGCTTGCGCATACCTCCGCAGCGCAGACACAACC 152 8 

Qy 1059 TACAGCAAGGTCCTGGGTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGC 1118 

I llll III II I II llllll I lllll I IMM II MM III 

Db 1529 CATAGCACGGTGCTTGGACTATACAACACCCTGAACCCAGAGGCGTCTGCCTCGCCATGC 1588 

Qy 1119 TGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAG 1178 

Mill II MM llllllll III lllll INN MINI I I llllll 

Db 1589 TGCGTCCCCCAGGACCTGGAGCCCCTGACCATCTTGTACTATGTGGGCAGAACCCCCAAG 164 8 

Qy 1179 GTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGG 122 8 

IIIMMIMMMMIIMI I III II II 1 1 1 1 1 . 1 1 1 1 1 1 1 

Db 164 9 GTGGAGCAGCTGTCCAACATGGTGGTGAAGTCGTGTAAGTGCAGCTGAGG 1698 



RESULT 14 
US-09-906-158-10 

; Sequence 10, Application US/09906158 

; Publication No. US20030078217A1 

; GENERAL INFORMATION: 

; APPLICANT: Brett P. Monia 

APPLICANT: Susan M. Freier 
; TITLE OF INVENTION: ANTI SENSE MODULATION OF TRANSFORMING GROWTH FACTOR-BETA 3 
EXPRESSION 

; FILE REFERENCE: RTS-02 57 



; CURRENT APPLICATION NUMBER : US/09/906 , 158 
; CURRENT FILING DATE : 2001-07-14 
; NUMBER OF SEQ ID NOS : 168 
; SEQ ID NO 10 

LENGTH: 287 9 

TYPE: DNA 
; ORGANISM: Mus musculus 

FEATURE : 

NAME/ KEY: CDS 
; LOCATION: (611) . . . (1843) 
US-09-906-158-10 

Query Match 17.2%; Score 233.2; DB 11; Length 2879; 

Best Local Similarity 52.9%; Pred. No. l.le-52; 

Matches 657; Conservative 0; Mismatches 528; Indels 57; Gaps 5; 



Qy 4 0 TGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCG 99 

ill iii i i iii i iii mill ii ii 

Db 612 TGCACTTGCAAAGGGCTCTGGTAGTCCTGGCCCTGCTGAACTTGGCCACAATCAGCCTCT 671 

Qy 100 GACTGTCCACCTGC^AGACCATCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCA 15 9 

MINIM MM M MM I I I I Ml MM I I II MM 

Db 672 CTCTGTCC^CTTGCACCACGTTGGACTTCGGCCACATCAAGAAGAAGAGGGTGGAAGCCA 731 

Qy 160 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 219 

II I II Mill II MIMI MM MINIMI I M II 

Db 732 TT AGGGG A CAGAT CTTGAG CAAG CT CAGG CT CA C CAG C C C C CCTGAG C CAT CGGTGATG A 791 

Qy 22 0 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 279 

I I II I II I II Mill lllllllllll Mill II III 

Db 792 CCCACG TCCCCTATCAGGTCCTGGCACTTTACAACAGCACCCGGGAGTTGCTGG 84 5 

Qy 28 0 CCG GGGAAAGTGT CGAAC CGG AG C C CG AG C CAG AGG CGGACTA CT 324 

I MM II III I Ml I II MM 

Db 84 6 AAGAGATGCACGGGGAGAGGGAGGAAGGCTGCACTCAGGAGACCTCGGAGTCTGAGTACT 905 



Qy 32 5 ACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAAT 384 

I Mill III M lllllll II MM 

Db 906 ATGCCAAAGAGATCCATAAATTCGACATGATCCAGGGACTGGCGGAGCACAATGAACTGG 965 

Qy 3 85 TC^GGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGG 44 4 

I MM I lllllll II I I I I 

Db 966 CCGTCTGCCCCAAAGGAATTACCTCTAAGGTTTTTCGTTTCAATGTGTCCTCAGTGGAGA 1025 

Qy 44 5 TGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCA 495 

III II II I II I III I II II 

Db 102 6 AAAATGGAACCAATCTGTTCCGGGCAGAGTTCCGGGTCTTGCGGGTGCCCAACCCCAGCT 108 5 

Qy 4 96 AGTTAAAAGTGGAG CAG CACGTGGAG CTATAC CAGAAATA CAGCAATG 54 3 

M III MM I I II lllllll 

Db 108 6 C CAAG CG CA CAGAG CAGAGAATTGAG CT CTTCCAGATA CTTCGA C CGGATGAG CACATAG 114 5 

Qy 54 4 ATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGT 603 

IIIIMI II II Ml II I II I I I I II lllllll 

Db 114 6 CCAAGCAGCGCTACATAGGTGGCAAGAATCTGCCCACAAGGGGCACCGCTGAATGGCTGT 12 05 



QY 



604 CCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTT 663 



I II MINIM I M I M I MINIM I 'MM I I INI 

Db 12 06 CTTTCGATGTCACTGACACTGTGCGCGAGTGGCTGTTGAGGAGAGAGTCCAACTTGGGTC 12 65 
Qy 664 TTCGCCTCAGTGCCCACTGTTCCTGTGACA GCAAAGATAACACACTCCACG 714 

I MM lllllll I Ml III III I III III I 

Db 12 66 TGGAAATCAGCATCCACTGTCCATGTCACACCTTTCAGCCCAATGGAGACATACTGGAAA 132 5 

Qy 715 TGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGA 774 

I I I I I I I III I I II 

Db 13 2 6 ATGTTCATGAGGTGATGGAAATCAAATTCAAAGGAGTGGACAATGAAGATGACCATGGCC 1385 

Qy 775 ACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCT 8 34 

I I I II INI I II II MINI I I 

Db 1386 GTGGAGACCTGGGGCGTCTCAAGAAGCAAAAGGATCACCACAACCCACACCTGATCCTCA 1445 

Qy 8 3 5 CCCGGCACCGCCGAGCCCTGGATACCAACGACTACAAGG ATGACGACGACAAGG 88 8 

I II II I II III I III I I I I II 

Db 14 4 6 TGATGATCCCCCCACACCGACTGGACAGCCCAGGCCAGGGCAGTCAGAGGAAGAAGAGGG 15 05 

Qy 88 9 CCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCT 94 8 

lllllll INN 1 1 1 1 1 1 1 1 II I INI 1 1 1 1 1 1 1 1 E I II III III 

Db 1506 CCCTGGACACCAATTACTGGTTCCGCAACCTGGAGGAGAACTGCTGTGTACGCCCCCTTT 1565 

Qy 94 9 ACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATG 1008 

I INIIIIIIIII INI II 1 1 1 1 1 1 1 1 Ml I N 1 1 1 1 1 Mill III III 

Db 1566 ATATTGACTTCCGGCAGGATCTAGGCTGGAAATGGGTCCACGAACCTAAGGGTTACTATG 162 5 

Qy 1009 CCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGG 1068 

INI Mill II II II II IN II I III MINI I Mil M 

Db 1626 CCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGCGCAGACACAACCCATAGCACGG 168 5 

Qy 1069 TCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGC 1128 

I II I II II I INN I INN II INI Mill I II I 

Db 168 6 TGCTTGGACTATACAACACCCTGAACCCAGAGGCGTCTGCCTCGCCATGCTGCGTCCCCC 174 5 

Qy 112 9 AGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGC 1188 

IN Mil IN INN lllllll WWW I I II I II M 1 1 1 1 1 1 II I 

Db 174 6 AGGACCTGGAGCCCCTGACCATCTTGTACTATGTGGGCAGAACGCCCAAGGTGGAGCAGC 18 05 

Qy 118 9 TGTCCAACATGATCGTGCGTTGCTGCAAGTGCAGCTGAGGCC 123 0 

I I I I I I I I I I I I III II II I I I I I I I Ml Ml II 
Db 18 06 TGTCCAACATGGTGGTGAAGTCGTGTAAGTGCAGCTGAGGCC 184 7 



RESULT 15 
US-09-813-271B-7 

; Sequence 7 , Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 
; (A) Nico Cerletti 

TITLE OF INVENTION: New process for the production of 

biologically active protein 
NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US20020115834Alart is Patent Department 
STREET: 564 Morris Avenue 



CITY: Summit 
STATE : New Jersey 
COUNTRY: USA 
ZIP: 07901 
COMPUTER READABLE FORM : 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE : Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/813 , 271B 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/ 027 1 9 

FILING DATE: 12 -Jul - 95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25-Jul-94 
ATTORNEY/ AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 

REGISTRATION NUMBER: 2264 0 

REFERENCE/DOCKET NUMBER: 4 -2 0 03 9C/C1C1 /USN 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (908) 522-6940 

TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 336 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: other nucleic acid 

DESCRIPTION: /desc = "recombinant hybrid DNA of 
IMMEDIATE SOURCE: 

CLONE: E. coli LC137/pPLMu . TGF-betal (44 /45 ) beta3 
FEATURE : 

NAME/KEY: mat_peptide 

LOCATION: 1 . . 132 

OTHER INFORMATION: /product= M N- terminal 44 amino 
acids of human TGF-betal M 
FEATURE: 

NAME/KEY: mat_peptide 
LOCATION: 133.. 336 

OTHER INFORMATION: /product = "C-terminal 68 amino 
acids of human TGF-beta3" 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 1 . .336 

OTHER INFORMATION: /product^ "hybrid TGF-beta named 
TGF-betal-3" 
SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
US-09-813-271B-7 

Query Match 16.8%; Score 227.2; DB 10; Length 336; 

Best Local Similarity 79.8%; Pred. No. 2.7e-51; 

Matches 268; Conservative 0; Mismatches 68; Indels 0; Gaps 0; 



Qy 8 88 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 94 7 



MINIM MMMM 1 1 1 1 1 1 1 1 1 II 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 60 

Qy 94 8 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 1007 

MMMIMMMI MMMM MMMMMMM M 1 1 MIMMMMMM 

Db 61 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 12 0 

Qy 1008 GCC^TTTCTGCCTGGGGCCCTGTCCCTAC^TCTGGAGCCTAGACACT(^GTAC^GC^AG 1067 

Mill MUM II II II II III M I M IIMM MUM I 

Db 121 GCC^CTTCTGCTCAGGCCCTTGCCCATACCTCra 18 0 

Qy 1068 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 112 7 

M MM MINIMI I INN I II N II INI 

Db 181 GTGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCC 24 0 

Qy 1128 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 118 7 

Nil MMMM III INN MM II II II INN MMMM' 

Db 241 CAGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCAG 3 00 

Qy 1188 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGC 1223 

M IINIINI I III II II II II III 
Db 3 01 CTCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGC 336 



Search completed: October 28, 2003, 09:04:34 
Job time : 409.294 sees 

GenCore version 5.1.6 
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OM nucleic - nucleic search, using sw model 



Run on: 



Title: 



October 27, 2003, 10:47:27 ; Search time 3357.54 Seconds 

( wi t hout a 1 ignment s ) 
9794.056 Million cell updates/sec 



US-10-017-372E-34 
Perfect score: 1353 

Sequence: 1 gatctggtaccgagatggcg cgattaaagcggccgcgact 1353 

Scoring table: I DENT I TY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 22781392 seqs, 12152238056 residues 

Total number of hits satisfying chosen parameters: 45562784 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : EST:* 

1 : em estba : * 



2: em_esthum:* 

3 : em_est in : * 

4 : em_estmu : * 

5 : em_estov : * 

6: em_estpl:* 

7 : em_estro : * 

8: em_htc:* 

9: gb_estl:* 

10: gb_est2:* 

11: gb_htc:* 

12: gb_est3:* 

13: gb_est4:* 

14: gb_est5:* 

15: em_estfun:* 

16: em_estom:* 

17: em_gss_hum:* 

18: em_gss_inv:* 

19: em_gss_pln:* 

20: em_gss_vrt:* 

21: em_gss_fun:* 

22: em_gss_mam:* 

23: em_gss_mus : * 

24 : em_gss_pro : * 

25: em_gss_rod : * 

26: em_gss_phg:* 

27: em_gss_vrl:* 

28: gb_gssl:* 

29: gb_gss2:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
BM562135 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



mRNA linear EST 2 0-FEB-2 0 02 
sapiens cDNA clone IMAGE : 5745463 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



BM562135 1072 bp 

AGENCOURT_6562032 NIH_MGC_118 Homo 
5', mRNA sequence. 
BM562135 

BM56213 5 .1 GI : 18 8 07966 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata 
Mammalia; Eutheria; Primates 
1 (bases 1 to 1072) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Life Technologies, Inc. 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/ LLNL at: 

http: //image. llnl .gov 

Plate: LLAM12768 row: h column: 08 



FEATURES 

source 



BASE COUNT 
ORIGIN 



High quality sequence start: 9 
High quality sequence stop: 647. 

Location/Qualifiers 

1. .1072 

/organism= "Homo sapiens" 
/ mo 1 _ t yp e = " mRNA " 
/db_xref="t axon: 9606" 
/clone=" IMAGE: 5745463" 
/ 1 i s sue_type= " 1 eukocy t e " 
/lab_host= n DH10B" 
/clone_lib= M NIH_MGC_118" 

/note= "Vector : pCMV-SPORT6; Site_l: Not I; Site_2 : EcoRV 
(destroyed) ; RNA source leukocytes from anonymous pool of 
non-activated adult donors. Library is oligo-dT primed 
and directionally cloned (EcoRV site is destroyed upon 
cloning). Average insert size 1.7 kb, insert size range 
1.2-3.3 kb. Library is normalized and enriched for 
full-length clones and was constructed by C. Gruber 
(Invitrogen) . Research Genetics tracking code 027. Note: 
this is a N I H_MGC Library." 
217 a 356 c 313 g 186 t 



Query Match 53.1%; Score 718.8; DB 12; Length 1072; 

Best Local Similarity 82.7%; Pred . No. 1.4e-146; 

Matches 888; Conservative 0; Mismatches 142; Indels 44; Gaps 4; 

Qy 131 GCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCT 190 

llllllllllllllllllllll lull I I . i I I I I , lllllllllll Mill 
Db 20 GCTGGTGAAGCGGAAGCGCATCTAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCT 7 9 

Qy 191 TGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGC 2 50 

: 1 1 1 1 1 II I M I ii 1 1 ill ii i iiiiiiiiiiiiiMiiiii mil ii M M 

Db 80 CGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGC 13 9 

Qy 251 TCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCC 310 

II MMMM 1 1 1 1 M 1 1 1 1 1 ! 1 1 MMMM MM I II M 1 1 1 1 1 M 1 1 II I 

Db 14 0 CCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCC 199 

Qy 311 AGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCA 3 70 

Mill I'l 1 1 1 1 1 : 1 1 M 1 1 1 1 M IM I II 1 : 1 1 1 II I III I 'I 1 1 1 I MM I 

Db 2 00 TGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGA 259 

Qy 371 AATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGA 43 0 

MMMM II MUM I I MIMI MMMM I MMMM II II 

Db 2 60 AATCTATGAC^GTTC^GCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGA 319 

Qy 431 GCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAG 4 90 

MMM MMMM II MMI II MMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMM 

Db 320 GCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAG 379 

Qy 4 91 GCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTG 550 

I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 

Db 380 G CTCAAGTTAAAAGTGGAG CAG CACGTGG AGCTGTACCAGAAATACAG CAACAATTCCTG 439 

Qy 551 GCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGA 610 

III III IIIIMMIIIIIIIIIII I Mill Mill II MMM I II Mill 



Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



44 0 GCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGA 4 99 

611 TGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCT 67 0 

MIIIIMIIIMIIIMMMM! Ill Ml 1 1 1 1 1 II Mill MINIM 

500 TGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCT 55 9 



671 



560 



731 



620 



CAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTT 

1 1 MMMM MMMMMMI MIMMMM 1 1 MMI 1 1 MMMM 

TAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTT 



730 



619 



790 



CAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCT 

II I I MMMM 1 1 1 1 - 1 1 1 1 1 1 ( 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMM 

CACTACGGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCT 67 9 



791 CCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGC 8 50 

MMMMIMMMMMMMMIMMM IMM MMMMIMMMMMI 

68 0 TCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGC 73 9 
851 CCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTT 910 

MMMI MMMM MMI 

74 0 CC CTGGA CA C CAACTATTG CTT 761 

911 CAGCTGCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCT 970 

MMMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I . I ! I ! 1 1 1 1 1 MINIUM I MUM 

762 CAGCTCCCCGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAGGGACCT 821 
971 GGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTG 103 0 

MMMIIMMM M M Ml I MINIMUM I II M II II MMMM 

822 CGGCTGGAAGTGGATCCACGAGCCCCAAGGCTACCATGCCCACTTCTGCCTCGGGCCCTG 881 
1031 TCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCA 10 90 

MMMM MMMM INN I lllllll MMI 1 1 1 1 1 1 1 II M 

GCCCTACATTTGGAGCCTGGACACGCGTACCAGCAAGTCCCTGGCCCTGTACACCCGCCA 



882 



941 



10 91 CAACCCGGGCGCGTCGGCGGCGCCGTGCTGC- -GTGCCGCAGGCGCTGGAGCCACTGCCC 114 8 

II II II I MINIMI I III I II II Mill II MINI 

942 TAACCGGGCGCCCTCGGCGGCGCGGGTTTGCTGGGCCCCAGGCCCCTGGACCCCCTGCCC 1001 
114 9 AT- -CGTGTACTACGTGGGCCGC- -AAGCCCAAGGTGGAGCAGCTGTCCAACAT 1198 

I I lllllll INI III I NMM III II MINI MM 

10 02 CTTCGGGGTACTACCTGGGGCGCCAAGCCCCAAGTGGGAACACCTGTCCCACAT 1055 



RESULT 2 

BX355682/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



BX355682 1201 bp mRNA linear EST 05-MAY-2003 

BX355682 Homo sapiens PLACENTA COT 25 -NORMALIZED Homo sapiens cDNA 
clone CSODI002YJ05 3 -PRIME, mRNA sequence. 
BX355682 

BX355682 . 1 GI :30371987 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1201) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Full-length cDNA libraries and normalization 

Unpublished 

Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http ; //www .genoscope . ens . f r/ 

cgi -bin/cluster . cgi?seq=CSODI 0 02CE03NPl&clust er=9 16 0 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSODI 002CE03NP1 . 

Location/Qual if iers 

1. .1201 

/organism="Homo sapiens" 
/ mo 1 _ t yp e = " mRNA " 
/ db_xr e f = " t a xon : 9 6 0 6 " 
/clone=" CSODI 002YJ05" 

/tiSSue_type= M PLACENTA COT 25 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 2 5 -NORMALIZED" 
/note="lst strand cDNA was primed with a Not I -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
198 a 326 c 386 g 253 t 38 others 



Query Match 52 . 7%; 

Best Local Similarity 83.0%; 
Matches 904; Conservative 



Score 713.6; DB 13; 
Pred. No. 2e-145; 
8; Mismatches 124; 



Length 12 01; 
Indels 53; Gaps 



7; 



Qy 264 ACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG - GCGGACTA 322 

I llllllllllll MUM I - MMMMMMMMI Ml MMIM 

Db 104 6 AACCGCGACCGGGTGGCCGGG - - AGAKKCAGAACCGGAGCCCGAGCCTGAGCCCGRACTA 98 9 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



323 CTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAA 3 82 

MMMMMMMMMMMMMMMMMMM I MM MMMMM M 

98 8 CTACGCCAAGGAGGTMACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATGACAA 92 9 
383 ATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGC 44 2 

MMM I I M I M I IMMMI I MMMM 1 1 MMMM MMI 

92 8 GTTCAAGCAGAGTACACACAGCATATATATGTTCTTCA^ 8 69 

44 3 GGTGCCGGAACCTGTATTGCTCT-CTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAA 501 

Ml II Mill II MUM I 1 1 : 1 ' 1 1 1 1 1 ' 1 1 1 1 1 1 1 ! I j I M 1 1 1 1 1 1 

868 GGTACCTGAACCCGTGTTGCTCTCCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAA 8 09 
502 AAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCA 561 

IMMMMMMMIMMM MMMMMMMMI II II 1 1 1 II I 1 1 1 1 1 1 1 

808 AAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACCTCA 74 9 
562 GCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAG 621 

1 1 1 1 1 1 M 1 1 M 1 1 II Mill Mill II MMM I II 1 1 II I II 1 1 II 1 1 1 1 1 

74 8 GCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAG 68 9 



Qy 622 TTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACT 681 

llllllllllllll III Ml III II II II II Illlllll II Illllll 

Db 688 TTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGA-GGCTTTCGCCTTAGCGCCCACT 630 

Qy 68 2 GTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCC 741 

I illlllll III I I Illllll II Mill II llllllllll I I MM 

Db 629 GCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCC 570 

Qy 74 2 GCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCA 8 01 

MM MMMMMMMMMM MMMMMMM MMMM MMMMM 

Db 56 9 GCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCA 510 

Qy 8 02 CCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCA 861 

MM. MINI MINIMI Mill 1 1 1 1 II I II II II II I II II II II II 

Db 5 09 CCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTGGAC 4 54 

Qy 862 ACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGG 921 

MMMM IMIMMMIIMM 

Db 4 53 ACCAACTATTGCTTCAGCTCCACGG 42 9 

Qy 922 AGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGT 981 

Mill MMMMMMM MM MMMMMMM MMMM MMMMM 

Db 42 8 AGAAGNACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGT 3 69 

Qy 982 GGATTCATGAACCC^GGGCTAC(^TGC(^ATTTCTGCCTGGGGCCCTGTCCCTACATCT 1041 

MM II II MMMMMMMMMM MMMM MMMM MMMM I 

Db 368 GGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTT 3 09 

Qy 1042 GGAGCCTAGACACTCAGTAC^GO^AGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCG 1101 

Illllll Mill 1 1 1 1 1 i 1 1 1 1 E 1 1 1 1 1 1 1 1 1 MMMMMMM MMMMM 

Db 3 08 GGAGCCTGGAC^CGCAGTAC^GC^AGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCG 24 9 

Qy 1102 CGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACG 1161 

I 1 1 1 1 Ml MM III MM M MMMM M 1 1 M MM MIMMMMMMMM 

Db 24 8 CCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACG 18 9 

Qy 1162 TGGGCCGCAAGCCC^GGTGGAGC^GCTGTCC^CATGATCGTGCGTTCCTGCAAGTGCA 1221 

1 1 1 1 1 1 1 1 1 i E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I i I i 1 1 1 1 j 1 1 1 1 1 1 f 1 1 1 

Db 188 TGGGCCGC^GCCCAAGGTGGAGC^GCTGTCCAACATGATCGTGCGCTCCTGO^AGTGCA 12 9 

Qy 1222 GCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCC 1281 

Illllll 1 1 1 1 1 1 1 1 1 1 1 1 Illlllll Mill MM MM 1 1 

Db 128 GCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCRGCAGGCCCGGCCCCACCCCGCCCCGCCC 69 
Qy 12 82 CGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCCACTTGGGAT 1333 

I II II I 1 1 1 : 1 1 1 1 1 , 1 MM I MINIMUM 

Db 68 CCGCTGCCTTGCCCATGGGGGCTGTATTTAA-GACACCCGTCCCCAAGCCCACNNAMNNM 10 

Qy 1334 CGATTAAAG 1342 

= :|hlll = 
Db 9 MDATHAAAS 1 



RESULT 3 
BX335351/C 

LOCUS BX335351 983 bp mRNA linear EST 01-MAY-2003 

DEFINITION BX335351 Homo sapiens PLACENTA COT 25 -NORMALIZED Homo sapiens cDNA 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



clone CS0DI013YH16 3 - PRIME , mRNA sequence. 
BX335351 

BX335351 .1 GI : 30308367 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria ; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 983) 

Li , W . B . , Gruber , C . , Jessee , J . and Polayes , D . 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 

http: //www. genoscope . ens . fr/cgi -bin/cluster . cgi?seq=CSODI 013DD08NP1 
&cluster=9160 . r . Contact : Feng Liang Email : fliang@lifetech.com 
URL : http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSODI 013DD08NP1 . 

Location/Qualif iers 

1. .983 

/organ ism=" Homo sapiens" 
/mol_type= " mRNA 11 
/db_xref ="taxon: 9606" 
/clone=" CSODI 013 YH16" 

/tissue_type=" PLACENTA COT 25 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 2 5 -NORMALIZED" 
/note= ft lst strand cDNA was primed with a Not I -ol igo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
170 a 280 c 323 g 203 t 7 others 



Query Match 51 . 9%; 

Best Local Similarity 84.5%; 
Matches 859; Conservative 



Score 702.6; DB 13; 
Pred. No. 4.7e-143; 
5; Mismatches 105; 



Length 983; 
Indels 47; 



Gaps 



5; 



Qy 



Db 



2 82 GGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACC 341 

Mill 1 1 1 1 1 1! : 1 1 1 II 1 1 1 II i 1 1 Mill I M 1 1 1 1 M 1 1 1 N 1 1 1 1 1 1 1 1 1 1 

983 GGGG AGAGTG CAGAA C CGG AG C CCG AG C CTG AGG C CGACTACTA CGC CAAGG AGGT CAC C 924 



Qy 

Db 

Qy 
Db 

Qy 

Db 



342 



923 



402 



CGCGTGCTAATGGTGGAAAGCGG(^C(^^VTCTATGATAAATTCAAGGGCACCCCCCAC 

I I I I I I I I I I I I I I I I I : I I MM 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 I I III 

CGCGTGCTAATGGTGGARACCCACAACGAAATCTATGACAAGTTCAAGCAGAGTACACAC 



AGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTG 

III 1 1 1 1 1 1 1 1 I II I M II I II Ml MM II 1 1 M M II Mill II III 

8 63 AGCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGTTG 



462 



803 



401 



864 



461 



804 



521 



CTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 

Mill I I I I I I I I I I I I I I I I I ! I I I I I I f I I I I I ! I I I I I I I I I I I I I M I I I I I I I 
CTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 744 



Qy 


522 


Db 


743 


Qy 


582 


Db 


683 


Qy 


642 


Db 


623 


Qy 


702 


Db 


564 


Qy 


762 


Db 


504 


Qy 


822 


Db 


444 


Qy 


882 


Db 


417 


Qy 


942 


Db 


363 


Qy 


1002 


Db 


303 


Qy 


1062 


Db 


243 


Qy 


1122 


Db 


185 


Qy 


1182 


Db 


125 


Qy 


1242 


Db 


66 



CTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCC 581 

II lllllllllllllllll 1 1 : 1 : 1 1 1 . 1 1 1! 1 1 II Ml 1 1 1 1 Ml 1 1 MM III 

CTGTACCAGAAATACAGCAACAATYCCTGGCGATACCTCAGCAACCGGCTGCTGGCACCC 684 



II Mill 1 1 MINI I II 1 1 1 1 1 1 M I ! I , II 1 1 1 i I M 1 1 i 1 1 1 1 1 III I 

AGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGC 
CGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGAT 

II III II II II II llllllll II MINIM IMMIMMMI III 

CGTGGAGGGGAAATTGA-GGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGAT 
AACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACC 

llllllll M Mill II llllllllll I I IIIIIIM MMMMMIMM 



ATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAG 821 

Mill I 1 MM llllllll II I M M.MM I II MM I III M I III 

ATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAG 44 5 



Mill llllllllllllllllll 



GACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGG 941 

llllllll llllllll I M MM I M II I Ml II 1 1 II I II Ml I II II III 

GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGG 364 



MINIMI IMIIMI I Mill Ml II M lllllllll 



TACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTAC 1061 

1 1 1 : 1 1 1 1 1 1 MiiiiM llllllll NNiiii llllllll inn him 

TACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTAM 244 
AGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGC 1121 

■■ Illlllllllll lllllllllllll Mill Mill lllllllllllll I I 

MMCAAGGTCCTGGCCMTGTACAACCAGCATAACCC-GGCGCCTCGGCGGCGCCGTNGT-C 186 

GTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTG 1181 

Illlllllllllllllllll I I I I I I I I J I f I I i I I I I I I I I I I llllllllllllll 
GTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCTGCAAGCCCAAGGTG 126 

GAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCC 1241 

MMMMMIMM MIIIIM 1 1 1 II II I II II II I II II I NININNI 

GAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCC- 6 7 
ACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTG 1297 

N I II MM 1 1 M M I M II 1 1 1 1 1 II MINI III II 

CCGCCCCGCCCCGGCAGGCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTG 1 5 



RESULT 4 
BX349319 

LOCUS BX34 9319 



900 bp mRNA linear EST 05-MAY-2003 



DEFINITION BX34 9319 Homo sapiens B CELLS (RAMOS CELL LINE) COT 2 5- NORMALI Z ED 

Homo sapiens cDNA clone CS0DL010YL07 5 -PRIME, mRNA sequence. 
ACCESSION BX349319 

VERSION BX349319.1 GI -.30379410 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 90 0) 

AUTHORS Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
TITLE Full-length cDNA libraries and normalization 

JOURNAL Unpubl i shed 
COMMENT Contact: Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi -bin/cluster . cgi?seq=CSOBAG059ZD04_CS05596_l&cluster= 9 160 . r . 
Contact : Feng Liang Email : fliang@lifetech.com URL : 
http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0BAG059ZD04_CS05596_1 . 
FEATURES Location/Qualifiers 
source 1. .900 

/organ ism= "Homo sapiens" 

/mol_type="mRNA" 

/dbjeref ="taxon: 9606" 

/clone="CS0DL010YL07 M 

/cell_type="B CELLS (RAMOS CELL LINE) COT 2 5 -NORMALI ZED" 
/cell_line="RAM0S CELL LINE" 

/clone_lib="Homo sapiens B CELLS (RAMOS CELL LINE) COT 
25 -NORMALIZED" 

/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 

BASE COUNT 179 a 292 c 270 g 155 t 4 others 

ORIGIN 



Query Match 51.2%; Score 693.2; DB 13; Length 900; 

Best Local Similarity 87.8%; Pred . No. 5.1e-141; 

Matches 755; Conservative 0; Mismatches 105; Indels 0; Gaps 0, 
Qy 10 C CGAGATGG CG C CTT CGGGG CTG CGG CTCTTG C CG CTG CTGCTG CCG CTGCTGTGG CTGC 6 9 

II III MM II lllllllllll I M llllllllll MIMMMIIMI I 

Db 3 0 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 9 

Qy 7 0 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 9 

I I M M I M M I M M I II 1 1 1 M M I Mill 1 1 1 r 1 1 1 1 1 1 1 1 1 1 MMMMM 

Db 90 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 14 9 



Qy 13 0 AG CTGGTGAAG CGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAG CTT CGG C 18 9 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllllllll lllllllllll MM 

Db 150 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 2 09 



Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 

I 1 1 1 1 1 ! 1 . 1 1 1 1 , 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 MMI II, 1 1 II Mill II II I 

210 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 269 
25 0 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 309 

I II MINIM 1 1 1 1 II 1 1 1 1 M 1 1 MINIM MM II I II 1 1 1 1 1 1 1 II 1 1 

27 0 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 32 9 

310 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 369 

I INN I I I M I I I I li I , I I I I I I II Ml II , I I III I I I I, I I I I I INI 
33 0 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 389 

370 AAATCTATGATAAATTCAAGGGCACCGCCCACAGCTTATATATGCTGTTCAACACGTCGG 429 

lllllllll; II llllll I I MINI MINIM I 1 1 . 1 1 1 1 N I 

3 90 AAATCTATGAC^GTTC^GCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 44 9 
430 AG CTCCGGGAAG CGGTG CCGG AA G CTGTATTG CTCT CTCGGG CAGAG CTG CG G CTG CTGA 48 9 

1 1 1 1 1 1 1 I II INN II MINIM 1 1 1 1 1 1 1 1 1 1 1 f r I llllll 

450 AGCTGCGAGAAGCGGTACCTGAAGCCGTGTTGCTCTCGGGGGCAGAGCTGCGTCTGCTGA 509 

4 90 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTGCT 54 9 

1 1 II 1 1 1 II 1 1 Ml I Ml III I II !l 1 1 1 II M I 1 1 II III I ! III 1 1 1 M MINI 

510 GG CT CAAGTTAAAAGTGGAG CAGCA GGTGGAG CTGTACCAGAAATA GAG CAA CAATTCCT 569 
550 GGCGCTACCTCAGCAACCGGCTGCTGGGCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 609 

MM 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 Mill INN N llllll I II INI 

570 GGCGATACGTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 629 
610 ATGTCACGGGAGTTGTGCGGCAGTGGCTGAGCCGCAGAGAGGCTATAGAGGGTTTTCGCC 669 

1 1 1 1 1 1 1 II I Ml 1 1 II I III 1 1 1 M Ml Ml III II II INN MM I 

63 0 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGGCGTGGAGGGGAAATTGAGGGCTTTCGCC 68 9 
670 TCAGTGCCCACTGTTCCTGTGAGAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 9 

i ii NiMiii 1 1 : : 1 1 1 - : 1 1 1 1 i j 1 1 1 ; J ; 1 1 1 n inn m iiiini 

690 TTAG CG C C CA CTG CTC CTGTGACAG CAGGGATAA CA CA CTG CAAGTGGA CAT CAACGGGT 74 9 
730 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 789 

III I I I ! 1 1 1 1 1 1 MM IMMIMIIIIII, NINNNINN II MM 

750 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTTCTGC 809 

7 90 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 9 

I U ! 1 1 II 1 1 II 1 1 I MM 1 1 II 1 1 1 II II I INN 1 1 1 . 1 1 1 1 1 1 1 ' 1 1 MM 

810 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGNCGAG 869 

850 CCCTGGATACCAACGACTAC 8 69 

MINI I II I I I 

8 70 CCCTGGNACACCACTATTGC 88 9 



RESULT 5 

BX324511/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 



BX324511 1041 bp mRNA linear EST 02-MAY-2003 

BX324511 Homo sapiens NEUROBLASTOMA COT 2 5 -NORMALIZED Homo sapiens 
cDNA clone CS0DC024YD20 3 -PRIME, mRNA sequence. 
BX324511 

BX324511 . 1 GI :30332381 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta ; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1041) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : / /www . genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CS0AC024DB10NP2&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0AC024DB10NP2 . 

Location/Qualifiers 

1. .1041 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone=" CS0DC024YD2 0" 

/ 1 i s sue_type= " NEUROBLASTOMA COT 25 -NORMALIZED" 
/clone_lib="Homo sapiens NEUROBLASTOMA COT 2 5 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -ol igo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
176 a 283 c 346 g 219 t 17 others 



Query Match 48.9%; Score 661.2; DB 13; 

Best Local Similarity 81.5%; Pred. No. 5.2e-134; 
Matches 870; Conservative 10; Mismatches 133; Indels 



Length 1041; 

55; Gaps 



8; 



Qy 


283 


GGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCC 

Ml 1 :| IIIIIIIIIIIIM II :h h II 1 h IMIMI 
SGGGAGAKGSAGAACCGGAGCCCGACCCTRASCCR- - -ACTAYACCCCAAGRAGTCACCC 


342 


Db 


1022 


966 


Qy 


343 


GCGTGCTAATGGTGGAAAGCGGQUVCCAAATCTATGATAAATTCAAGGGC^CCCCCOVCA 

IIMMIIIIMI MM 1 MM MMMIMI II MIMI 1 1 MM 

GCGTGYTAATGKT-GAAACCCACAACGAAATCTATGAC^GTTC^GCAGAGTACAC^CA 


402 


Db 


965 


907 


Qy 


403 


GCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGC 

M MMMM 1 IIIIMM 1 1 IMMMI IMMMI II Mill II MM 

GCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGTTGC 


462 


Db 


906 


847 


Qy 


463 


TCTCT C - GGG CAG AG CTG CG CCTG CTGAGG CTCAAGTTAAAAGTGGAG CAG CACGTGG AG 

MM 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1! 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 

TCTCCCSGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 


521 


Db 


846 


787 



Qy 



522 CTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCC 581 



II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 MINIMI M MM II Mill I M 1 1 1 M N Ml 

Db 78 6 CTGTACCAGAAATACAGCAACAATTCCTGGCGATACCTCAGCAACCGGCTGCTGGCACCC 727 



Qy 582 AGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACC 641 

II Mill II llllll I II I Ml II 1 1 M MM I! I Ml II Ml 1 1 N Ml I 

Db 726 AGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGC 667 



Qy 64 2 CGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGAT 7 01 

II III II II II II Illlllll I I Illlllll I I I I I I I I I I I 1 IN 
Db 666 CGTGGAGGGGAAATTGA-GGCTTTCGCCTTAGCGCCG^CTGCTCCTGTGACAGCAGGGAT 608 



Qy 7 02 AACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACC 761 

Illlllll II Mill II I Illlllll I I I MINIM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 607 AACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCACC 54 8 



Qy 762 ATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAG 821 

INN MMMIMIMM Illlllll 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 54 7 ATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAG 4 8 8 



Qy 822 CACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACGACTACAAGGATGACGAC 881 

N INN 1 1 III II II 1 1 M II II Ml II II M 

Db 487 CATCTGCAAAGCTCCCGGCACCGCCGAGCCCTGGAC 452 



Qy 882 GACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGG 941 

Illlllll M I III 1 1 1 M MM I! II II II M I M II M I Ml 

Db 4 51 ACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGG 4 07 



Qy 942 CAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGC 1001 

INN MMMIMIMM Illlllll 1 1 1 1 1 1 1 1 1 N 1 1 1 N II 1 1 1 1 1 1 1 1 1 

Db 4 06 CAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGC 34 7 



Qy 10 02 TACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTAC 1061 

1 1 1 1 1 1 1 1 1 1 1 Illlllll Illlllll Illlllll Illlllll INN llllll 

Db 34 6 TACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTAC 287 



Qy 1062 AGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGC 1121 

MMMIMIMM MMMIMIMM INN INN II II 1 1 N II N II III 

Db 286 AGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC-GGCGCCTCGGCGGCGCCGTG-TGC 22 9 



Qy 1122 GTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTG 1181 

1 1 ! 1 1 1 , 1 1 1 1 1 1 1 1 1 , 1 1 1 M Ml M 1 1 M I II MM MM M II 1 1 M I II M I Ml 

Db 228 GTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTG 169 



Qy 1182 GAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCC 1241 

M I M II II M II MM MM II I II N 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 llllllllllll 

Db 168 GAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCC 109 



Qy 124 2 ACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCCCGCCT CACCGGG 12 93 

Illlllll INN INI INI IN II N IN 

Db 108 CGCCCCGCCCCGGCAGGCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGG 4 9 



Qy 12 94 GCTG TATTTAAGGA CATCGTG C C C CAAG C C CA CTTGGGAT CGATTAAA 1341 

IINIIIIIIIINN I llllllllllll MM I : II 

Db 4 8 GCTGTATTTAAGGACACCCGTCCCCAAGCCCACCTGGSCACCCAYTAA 1 



RESULT 6 



BX434425/C 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BX434425 888 bp mRNA linear EST 15-MAY-2003 

BX434425 Homo sapiens PLACENTA Homo sapiens cDNA clone CS0DE014YE16 
3 -PRIME, mRNA sequence. 
BX434425 

BX4 344 25. 1 GI : 307792 91 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 888) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CSOBAK028AB08NMl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0BAK028AB08NM1 . 

Location/Qualifiers 

1. .888 

/organism="Homo sapiens" 
/mol_type=" mRNA " 
/db_xref="taxon: 9606" 
/clone=" CS0DE014YE16" 
/ 1 i s sue_type= 11 PLACENTA 11 
/clone_lib="Homo sapiens PLACENTA" 

/note=" Vector : pCMVSP0RT_6; 1st strand cDNA was primed 
with a Not I -ol igo (dT) primer. Five prime end enriched, 
double-strand cDNA was digested with Not I and cloned into 
the Not I and EcoRV sites of the pCMVSPORT 6 vector. 
Library was not normalized." 
156 a 253 c 302 g 176 t 1 others 



Query Match 46.0%; Score 622.4; DB 13; Length 888; 

Best Local Similarity 83.6%; Pred. No. 1.5e-125; 

Matches 772; Conservative 0; Mismatches 102; Indels 49; Gaps 



4; 



Qy 
Db 

Qy 
Db 

Qy 



432 CTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGG 4 91 

1 1 1 1 Ml I II Mill I Ml I IMMMMMMI MIMM I 

888 CTCCAAAAAGCGGTACCTGAACCCTTGTTGTCTNCCCGGGCAGAGCTGCGTCTGCTGA-G 83 0 
4 92 CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGG 551 

MMMM Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 ] I M 1 1 1 1 1 1 1 1 MMMII 

82 9 CTCAAGTTTAAAATGGAGCAGCACGTGGAGCTGTACCAGAAATAC^GC^CAATTCCTGG 77 0 
552 CGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGAT 611 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 I III Mill II MMM I II 1 1 1 1 II 



Db 769 CGATACCTC^GCAACCGGCTGCTGGC^CACAGCGACTCGCCAGAGTGGTTATCTTTTGAT 710 

Qy 612 GTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTC 671 

IIIIMIMIIIIMIMMIIII 1 1 1 (II Ml II M Mill MM III 

Db 709 GTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTT 65 0 

Qy 672 AGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTC 731 

1 1 MMMM MMMMMMI MMMMMI 1 1 MMI 1 1 MMMMI 

Db 64 9 AGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGGVAGTGGACATCAACGGGTTC 5 9 0 

Qy 732 AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTC 791 

I I I MMMM IMMIMMMIMMMI MMMMMMM MMMM 

Db 58 9 ACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTT 53 0 

Qy 792 CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCC 8 51 

MMMMMMMMMMMMMMMM Mill 

Db 52 9 CTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA 4 73 

Qy 852 CTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTC 911 

MMMM MMMM MUM 

Db 472 GCCCTGGACACCAACTATTGCTTC 44 9 

Qy 912 AGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTG 971 

I I M M 1 1 M 1 1 M 1 1 1 M I M I M M I M M I M IMMMMMMI IIIIIIII 

Db 44 8 AGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTC 38 9 

Qy 972 GGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGT 1031 

MMMMMMM II II 1 1 II II I II I II II II M 1 1 MMMM MMMM 

Db 388 GGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGC 32 9 

Qy 1032 CCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCAC 1091 

MMMM IIIIIIII Mill MMMMIMMMIMM MMMMMMM 

Db 328 CCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGG 269 

Qy 10 92 AACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATC 1151 

MMMMMI 1 1 ! 1 1 1 1 1 1 f / 1 1 1 1 1 1 1 1 1 1 1 F 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 MMMMI 

Db 2 68 AACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATC 2 09 

Qy 1152 GTGTACTACGTGGGCCGC^GCCC^AGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCC 1211 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 2 08 GTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCC 14 9 

Qy 1212 TGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCC 1271 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMMMI IIIIIIII Mill III 

Db 14 8 TGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCACCC 8 9 
Qy 1272 CACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA - TCGTGCCCCAAGC 1322 

I MM III M M 1 1 1 1 1 1 1 1 1 1 f I i 1 1 1 ! 1 1 MM 1 1 

Db 88 CGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCCGTGCCCCAAGC 2 9 

Qy 1323 CCACTTGGGATCGATTAAAGCGG 134 5 

MM MM I MMMI II 

Db 2 8 C CA C CTGGG C C C CA TTAAAGAGG 6 

RESULT 7 
BM555996 



LOCUS BM555996 1043 bp mRNA linear EST 20-FEB-2002 

DEFINITION AGENCOURT_65444 3 7 NIH_MGC_88 Homo sapiens cDNA clone IMAGE : 555003 9 

5', mRNA sequence. 
ACCESSION BM555996 

VERSION BM555996.1 GI : 18796907 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 1043) 

AUTHORS NIH-MGC http://mgc.nci.nih.gov/. 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 

JOURNAL Unpubl i shed 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email : cgapbs -r@mail .nih,gov 
Tissue Procurement: ATCC 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : / / image . llnl . gov 
Plate: LLAM12261 row: i column: 16 
High quality sequence stop: 702. 
FEATURES Locat ion/Qua 1 i f iers 

source 1. .1043 

/organism="Homo sapiens" 
/mol__type="mRNA" 
/db_xref= M taxon: 9606" 
/clone=" IMAGE: 555 003 9" 

/tissue_type= "duodenal adenocarcinoma, cell line" 
/lab_host = "DH10B (phage-resistant) 11 
/ c 1 one_l i b= " N I H_MGC_8 8 " 

/note=" Organ: small intestine; Vector: pCMV-SP0RT6; 
Site_l: Not I; Site_2: Sail; Cloned unidirect ionally ; 
oligo-dT primed. Average insert size 1.767 kb . Library 
enriched for full-length clones and constructed by Life 
Technologies. Note: this is a NIH_MGC Library." 

BASE COUNT 202 a 329 c 345 g 154 t 13 others 

ORIGIN 

Query Match 46.0%; Score 622.2; DB 12; Length 1043; 

Best Local Similarity 82.2%; Pred . No. 1.7e-125; 

Matches 736; Conservative 0; Mismatches 120; Indels 39; Gaps 1; 

Qy 4 00 ACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTAT 4 5 9 

Mill llllllll I llllllll II llllllll llllllll || Mill !! I 
Db 1 A(^GC^TATATATGTTCTT(^ACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGT 60 

Qy 4 60 TG CTCTCTCGGG CAG AG CTG CG C CTG CTG AGG CTCAAGTTAAAAGTGG AG CAG CACGTGG 519 

Illllll llllllllllllll Ml Mill I! II MINI; INI I IIIIIUI I III 

Db 61 TGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGG 12 0 

Qy 52 0 AGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCC 57 9 

INI Mill Ml llllll Ml II III II III II II III II II 1 1 III II Ml II I 

Db 121 AGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACCTCAGCAACCGGCTGCTGGCAC 18 0 



Qy 


580 


CCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGA 

MM MMI M MMM 1 M M M 1 M M M M M M M M M M M M 1 Ml 

CCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGA 


639 


Db 


181 


240 


Qy 


640 


CCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAG 

Ml III II II Illll MMMM II MMMM lllllllllllll 1 

GCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGG 


699 


Db 


241 


300 


Qy 


700 


ATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCA 

MINIMI, II Mill II Illlllllll 1 1 MMMM MIMMMIIM 

ATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCA 


759 


Db 


301 


360 


Qy 


760 


C CATT CA CGGCATGAACCGG C C CTT C CTG CT C CT CATGG CCA C C CCG CTGGAGAGGG C CC 

Illllll MMMMMMM llllllll 1 M II 1 1 1 1 1 II II 1 1 1 1 II 1 II II 1 1 1 

CCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCC 


819 


Db 


361 


420 


Qy 


820 


AGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACGACTACAAGGATGACG 

MM Mill MINIMUM! 

AGCATCTGCAAAGCTCCCGGCACCGCCGA 


879 


Db 


421 


449 


Qy 


880 


ACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGC 

llllllll MMMM 1 1 1 III II II 1 N MINI II II 1 N 1 NN 1 M 

GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGC 


939 


Db 


450 


501 


Qy 
Db 


940 
502 


GGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGG 

Illllll MMMMMMM llllllll 1 1 1 1 1 1 1 1 1 1 1 M 1 II II Illllll 

GGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGG 


999 
561 


Qy 


1000 


GCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGT 

lllllllllllll llllllll llllllll IMIIIII MMMM Illll MM 

GCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGT 


1059 


Db 


562 


621 


Qy 


1060 


ACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCT 

Illlllllllllllll MMMMMMM MIMIIIIII M II 1 1 II 1 1 1 1 1 1 1 1 

ACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCT 


1119 


Db 


622 


681 


Qy 


1120 


GCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGG 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 M 1 1 1 1 1 1 1 1 1 II M M 1 M II 1 1 1 1 1 1 

GCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGG 


1179 


Db 


682 


741 


Qy 


1180 


TGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGC 

Mllllllllllllllllllllllllll Illllll Ml 1 1 III 

TGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAGTGCACCTTGAAGGGTCCGGCCCC 


1239 


Db 


742 


801 


Qy 


1240 


CCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGG 1294 




Db 


802 


III II III 1 M 1 II II 1 1 II 
CACNCGGGNCNNNNNGGNNNNGGGGGGNCCCCGGCGCCGGGCCCCCGCGGCCCGG 8 56 





RESULT 8 

BX383773/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 



BX383773 950 bp mRNA linear EST 08-MAY-2003 

BX383773 Homo sapiens HELA CELLS COT 2 5 -NORMALIZED Homo sapiens 
cDNA clone CS0DK001YA15 3-PRIME, mRNA sequence. 
BX383773 

BX383773 . 1 GI : 30457168 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Chordata; Craniata; Vertebrata; Euteleostomi ; 
Primates; Catarrhini; Hominidae; Homo. 



BASE COUNT 
ORIGIN 



EST. 

Homo sapiens (human) 
Homo sapiens 
Eukaryota ; Metazoa ; 
Mammalia; Eutheria; 
1 (bases 1 to 950) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http: //www. genoscope. ens . fr/ 

cgi -bin/cluster . cgi?seq=CS0DK0 01AA08NPl&cluster=9 160 . r . Contact : 
Feng Liang Email : f 1 iang^l if etech . com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DK001AA08NP1 . 

Locat ion/Qua 1 if iers 

1. .950 

/organism="Homo sapiens " 
/mol_type= H mRNA n 
/db_xref ="taxon: 9606" 
/clone="CS0DK001YA15 H 

/cell_type="HELA CELLS COT 2 5 -NORMALI ZED" 
/cell_line="HELA" 

/clone_lib="Homo sapiens HELA CELLS COT 2 5 -NORMALIZED" 
/note="lst strand cDNA was primed with a Not I -ol igo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
171 a 260 c 312 g 195 t 12 others 



Query Match 45.5%; Score 615.2; DB 13; Length 950; 

Best Local Similarity 80.4%; Pred. No. 5.5e-124; 

Matches 791; Conservative 8; Mismatches 134; Indels 51; 



Gaps 



5; 



Qy 



Db 



357 GAAAGCGGCAACCAAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTG 416 

MM I 1 1 1 1 MINIMI 1 1 MINI I : I IIMII MINIM I 

94 3 GRAACCCACAACGAAATCTATGACAAGTTCAAGCAGAGMA 8 84 



Qy 

Db 

Qy 
Db 

Qy 
Db 



417 TTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAG 4 76 

1 1 II h II 1 1 MMMM MIMMI 1 1 hill II llllllll MINIMI 

8 83 TTCAAMACATCAGAGCTCCGAGAAGCGGTACCTGVACCCGTGTTGCTCTCCCGGGCAGAG 824 



477 



536 



CTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATAC 

Mill 1 1 Nil Ml I ! i I M 1 1 1 II 1 1 1 1 1 M Ml 1 1 II I II I II IIIIIMIIMI 

8 23 CTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATAC 764 



537 



763 



AGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAG 

1 1 1 1 1 1 1 MMIM M M MM I MM Ml 1 1 II 1 1 II Mill Mill 1 1 III 

AGC^C^TAMCTGGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAG 



596 



704 



Qy 


597 


TGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATA 

III 1 II 1 ! ' 1 1 ' i : 1 . 1 i Ml IN III II II 

TGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATT 


656 


Db 


703 


644 


Qy 


657 


GAGGGTTTTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACAGTCCACGTG 

II II llllllll II llllllll IIIIM IIIIM III IMIMI II III 
GA-GGCTTTCGCCTTAGCGCCCACTGCTCCTGTTACAGCAGGGATTACACACTGCAAGTG 


716 


Db 


643 


585 


Qy 


717 


GAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAAC 


776 


Db 


584 


M M MMMM 1 1 1 MMMM MMMMMMMMMM MMMMI 

GACATCAACGGGTTTACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAAC 


525 


Qy 


111 


CGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCC 

Mill MMMM M MIIIIIIIMi Ml M. MM Mill Mill IIIIM 

CGGCCTTTCCTGCTTCTCATGGCCACCCGGCTGGAGAGGGCCCAGCATCTGCAAAGCTCC 


836 


Db 


524 


465 


Qy 


837 


CGGCACCGCCGAGCCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGAT 

Illlllllllll MMMM 

CGGCACCGCCGA GCMCTGGAC 


896 


Db 


464 


444 


Qy 


897 


ACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGAC 


956 


Db 


443 


MMMM M II II II 1 II M 1 M 1 II II M 1 M II M 1 II M M M M MMMMI 

ACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTKAC 


384 


Qy 


957 


TTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTC 

Mill MMMM IIIIIIIIIIIMI II II IMIMIMMIIIMM 1 III 

TTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCGACTTC 


1016 


Db 


383 


324 


Qy 


1017 


TGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCT 

Mill MMMM MMMM MMMM Mill 1 M 1 II 1 1 1 1 1 1 M 1 1 1 II 1 

TGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCG 


1076 


Db 


323 


264 


Qy 


1077 


CTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTG 

1 II 1 II Mill MMMIMIMM IMMIIMMMMMI 

CCTGTACAACCAGCATAACCCGGCGCCTCGGCGGCGCCGTG- TGCGTGCCGCAGGCGCTG 


1136 


Db 


263 


205 


Qy 


1137 


GAGCC^CTGCCCATCGTGTACTACGTGGGCCGO^GCCCAAGGTGGAGCAGCTGTCCAAC 

Mill MMMMMMMIMMIMM MMIMMMIMMMMIIMM II 

GAGCCGCTGCCCATCGTGTACTACGTGGGCKKCAAGCCCAAGGTGGAGCAGCTGTCCNAC 


1196 


Db 


204 


145 


Qy 


1197 


ATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACC 

1 1 1 1 M II M 1 MIMIMIIIIMMIII IMIMMIIM II 1 II 

ATGATCGTGCGCTCCTGCAAGTGCAGCTGAGATCCCGCCCCGCCCCGCCCCGCCCCGGAA 


1256 


DO 


144 


85 


Qy 


1257 


CGGCAGGCCCGGCCCCACCCCCGCCCGC CTCACCGGGGCTGTATTTAAGGA 

Mill III MMMM 1 1 II MMMIMIMM 1 

GGCCGCGGCCCCACCCCGCCCCGCCCCCSCTGCCTTGCCCATGGGGGCTGTATTTAAGNA 


1307 


Db 


84 


25 


Qy 


1308 


-CATCGTGCCCCAAGCCCACTTGG 133 0 




Db 


24 


1 1 II MMMMI III 

CCCCCCTGGCCCAAGCCCACCTGG 1 





RESULT 9 
AL530081 

LOCUS AL53 0081 859 bp mRNA linear EST 23-MAY-2 003 

DEFINITION AL530081 Homo sapiens NEUROBLASTOMA COT 50 -NORMALIZED Homo sapiens 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



CDNA clone CS0DD009YM06 5-PRIME, mRNA sequence. 
AL530081 

AL53 0081 .2 GI : 31067 916 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 859) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 

On Feb 13, 2001 this sequence version replaced gi: 12793574. 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 

was normalized. Library was constructed by Life Technologies, a 

division of Invitrogen. This sequence belongs to sequence cluster 

9160. r For more information about this cluster, see 

ht tp : // www . genoscope . ens . f r/ 

cgi -bin/cluster. cgi?seq=CS0DD009BG03QPl&cluster=916 0 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSODD009BG03QP1 . 

Loca t ion/ Qual i f iers 

1. .859 

/organism="Homo sapiens" 
/mol_type=" mRNA" 
/db_xref ="taxon: 9606" 
/clone=" CS0DD009YM06" 

/ t is sue_type= "NEUROBLASTOMA COT 5 0 -NORMALIZED" 
/clone_lib="Homo sapiens NEUROBLASTOMA COT 5 0 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
179 a 272 c 257 g 148 t 3 others 



Query Match 42 . 9%; 

Best Local Similarity 87.3%; 
Matches 645; Conservative 



Score 58 0.6; DB 9 
Pred. No. 1.9e-116 
2; Mismatches 91 



Length 8 59; 

Indels 1; Gaps 1; 



Qy 

Db 



1 0 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 9 

II III I I I I II llllllllill Ml lllllllll llllllllllllll I 
121 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTAC 18 0 



Qy 

Db 

Qy 
Db 



7 0 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 129 

I I II M 1 1 1 1 1 II I II 1 1 1 1 1 1 1 II, I Mill i 1 1 M I ! 1 1 1 1 1 1 1 MIMIMM 

181 TGGTG CTGACG CCTGG C CGG CCGG CCG CGGGACTAT C CAC CTG CAAG A CTATCGA CATGG 24 0 
13 0 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 9 

1 1 1 II 1 1 1 1 1 1 II M M 1 1 M M I III 1 1 1 llllillllll 1 1 1 1 f M 1 1 1 1 MM 

241 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 3 00 



Qy 



190 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 9 



Db 


301 


Qy 


250 


Db 


361 


Qy 


310 


Db 


421 


Qy 


370 


Db 


481 


Qy 


430 


Db 


541 


Qy 


490 


Db 


601 


Qy 


550 


Db 


661 


Qy 


609 


Db 


721 


Qy 


669 


Db 


781 


Qy 


729 


Db 


841 



I I II I i 1 1 1 1 1 , M 1 1 1 1 1 1 IIIIIIIMIIIIIIIIIII Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 36 0 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 09 

I II MINIM ill III 1 1 1 ' II , 1 1 1 1 1 1 1 1 MM : I ! 1 1 1 1 1 1 1 1 I f 1 1 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 42 0 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 369 

I Mill - I I I I 1 ^ I . I I ' I ' I I I I I i I I I ! : ! I 1 I I : I I I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 4 8 0 
AAATCTATGATAAATTOVAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 9 

MMIIMM II MUM I I MINI MINIM I MINIM M I 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 54 0 
AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 9 

NIMH I'M M II INN II 1 1 1 ! 1 1 1 r : . 1 1 1 ; I 1 1 1 1 1 1 MINN 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 6 0 0 
GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 9 

1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 : 1 1 1 ! 1 1 . 1 1 1 > 1 1 1 1 1 1 1 ! 1 1 1 i I [ 1 1 . 1 1 i MINI 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 660 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTG-GCTGTCCTTT 608 

INI M 1 1 1 1 1 II Ml M M 1 1 1 II M Mill Mill II III I I II IN 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGTGTTATCTTTT 72 0 
GATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGC 668 

1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 F 1 1 : 1 III IN III 1 1 M INN MINI 

GATGTCACCGGAGTTGTGCGGCAGTKGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGC 78 0 
CTCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGG 728 

II II I f 1 1 r 1 1 1 MIMIIIIIMI IIMIIINN II llhl 1 1 WWW 

CTTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGRACATCAACGGG 84 0 



III 



INN I 



RESULT 10 

BE312000 

LOGUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BE312000 713 bp mRNA linear EST 26-OCT-2000 

601154768F1 NIH_MGC_19 Homo sapiens cDNA clone IMAGE: 3510592 5', 
mRNA sequence. 
BE312000 

BE312000.1 GI :9130128 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa,- Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia/ Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 713) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Email : cgapbs -r@mai 1 .nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Ling Hong/Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: image.llnl.gov 
Plate: LLCM195 row: h column: 17 
High quality sequence start: 2 
High quality sequence stop: 713. 
Locat ion/Qua 1 i f iers 
1. .713 

/organism="Homo sapiens" 
/mol_type= " mRNA " 
/db_xref="taxon: 9606" 
/clone=" IMAGE: 3 510592" 
/tissue_type="neuroblastoma " 
/lab_host="DH10B (phage-resistant ) " 
/cl one_l ib= " NI H_MGC_1 9 " 

/note="0rgan: brain; Vector: pOTB7; Site_l : Xhol; Site_2 : 
EcoRI ; cDNA made by oligo-dT priming. Directionally 
cloned into EcoRI/XhoI sites using the following 5' 
adaptor: GGCACGAG(G). Library constructed by Ling Hong 
in the laboratory of Gerald M . Rubin (University of 
California, Berkeley) using ZAP-cDNA synthesis kit 
(Stratagene) and Superscript II RT (Life Technologies) . 
Note: this is a NIH_MGC Library." 
157 a 226 c 211 g 119 t 



Query Match 42 .4%; 

Best Local Similarity 87.8%; 
Matches 626; Conservative 



Score 573.8; DB 10; 
Pred. No. 5.5e-115; 
0; Mismatches 87; 



Length 713; 



Indels 



0; Gaps 



0; 



Qy 



Db 



139 AG CGG AAG CG CATCGAGG CCATTCGCGGC CAG ATTCTG T CCAAG CTTCGGCTTGC GAG C C 198 

IIMIIIL I'll, llllll I lllllllllll lllllllllll Mill lllllll 

1 AGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAGCC 60 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



199 CCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACA 

lllllilllMlllli II 1 1 II 1 1 1 II Ml 1 1 ' 1 1 ' Mill II II II II MM 

6 1 CCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTACA 



259 



258 



120 



318 



ACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGG 

MM llllllllllllll llllllll MM MMIIIIIIIMIIII Mill I 

121 ACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCG 18 0 
319 ACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATG 378 

I II M 1 1 1 II II 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II I I MM MMMMI 

181 ACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATG 24 0 
379 ATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGG 43 8 

I II MUM I I MUM llllllll I IIIIMM II llllllll I 

241 ACAAGTTC^GCAGAGTACACACAGC^TATATATGTTCTTC^CAC^TCAGAGCTCCGAG 3 00 



Qy 439 AAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGT 4 98 

MIMM II Mill II IIIIMM llllllllllllll 1 1 1 M II I M 1 1 1 1 ll 

Db 3 01 AAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGT 360 



Qy 4 99 TAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACC 558 

1 1 1 '1 1 1 , IN , 1 1 1 1 1 1 1 1 1 1 1 II I. II II 1 1 ' I II I M I 1 1 1 1 1 1 II 1 1 1 1 1 1 

Db 361 TAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACC 420 



Qy 559 TCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCG 618 

IIMMMIIIMIIMM lllll IIIM II IMIII I II I II I M II II M I 

Db 421 TCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCG 4 80 

Qy 619 GAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCC 678 

1 1 1 1 1 1 1 'IN 'I I U III III III II II lllll llllllll II 1 1 1 1 

Db 4 81 GAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCC 54 0 

Qy 67 9 AGTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTG 738 

I I II I I I II I . I i I I I I lll'll, II II lllll II llllllllll I I I 
Db 541 ACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACTACCG 600 

Qy 73 9 GCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAAGCGGCCCTTCCTGCTCCTCATGG 7 98 

Illllll MIMIMMMMMIMI IMIIIMIIMM llllllll lllllll 

Db 601 GCCGCCGAGGTGACGTGGCCAGCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGG 660 

Qy 799 CCACGGCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCC 851 

I II . 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill lllllll I I MINI 

Db 661 CCACCCGGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCCGGCACCGCGAGCC 713 



RESULT 11 

BE260971 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BE260971 717 bp mRNA linear EST 26-OCT-2000 

601153715F1 NIH_MGC_19 Homo sapiens cDNA clone IMAGE : 3509 93 1 5*, 
mRNA sequence. 
BE260971 

BE260971.1 GI:9132709 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hommidae; Homo. 

1 (bases 1 to 717) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Ling Hong/Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: image.llnl.gov 
Plate: LLCM193 row: m column: 04 
High quality sequence stop: 713. 

Location/Qualifiers 

1. .717 

/organism="Homo sapiens" 
/ mol _ t yp e = " mRNA " 
/db_xref ="taxon: 9606" 



/ c 1 one =" IMAGE : 3509931" 
/tissue_type= "neuroblastoma " 
/lab_host="DH10B (phage-resistant ) " 
/cloneJLib="NIH_MGC__19 M 

/note="0rgan: brain; Vector: pOTB7; Site__l: Xhol; Site_2: 
EcoRI; cDNA made by oligo-dT priming. Direct ionally 
cloned into EcoRI /Xhol sites using the following 5' 
adaptor: GGCACGAG ( G ) . Library constructed by Ling Hong 
in the laboratory of Gerald M. Rubin (University of 
California, Berkeley) using ZAP-cDNA synthesis kit 
(Stratagene) and Superscript II RT (Life Technologies) . 
Note: this is a NIH_MGC Library." 

BASE COUNT 158 a 226 c 213 g ~~ 120 t 

ORIGIN 

Query Match 41.8%; Score 565.8; DB 10; Length 717; 

Best Local Similarity 87.7%; Pred. No. 3.1e-113; 

Matches 62 9; Conservative 0; Mismatches 87; Indels 1; Gaps 1; 

GCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCC 199 

1 1 1 1 1 1 1 1 1 1 II N 1 1 MM 1 1 IMIIIIIIII llilll.llll Mill 1 1 1 1 1 1 1 1 

GCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAGCCC 6 0 
CCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAA 259 

MIMMMM Ml M II ! 1 1 1 II It 11 M 1 1 II Mill II II II II Mill 

CCCGAGCCAGG -GGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTACAA 119 
CAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGA 319 

III MINIM MM; 1 1 1 1 ! 1 1 1 MM 1 1 1 1 1 1 1 1 1 II I II 1 1 1 Mill II 

CAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCGA 17 9 
CTACTACGCCTVAGGAGGTC^CCCGCGTGCTAATGGTGGAAAGCGGC^ACCAAATCTATGA 37 9 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATGA 23 9 
TAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGA 43 9 

II MMM I I MIMI MINIM I I i 1 1 1 1 1 1 M 1.111,11 M 

CAAG TT CAAG CAGAG TA CA CA CAG CATATATATGTT CTT CAA CA CAT CAGAG CT C CG AGA 2 99 
AGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTT 4 9 9 

MIMI II Mill II MINIM 1 1 M 1 11 1 1 1 M 1 1 IMMMMMMMM 

AG CGGTACCTGAACCCGTGTTGCTCTCCCGGG CAGAG CTGCGTCTGCTGAGGCTCAAGTT 35 9 
AAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCT 559 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIUM Mill 

AAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACCT 419 
CAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGG 619 

M Ml I IN I II 1 1 1 1 N Mill Mill II MIMI I M 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 



Qy 


140 


Db 


1 


Qy 


200 


Db 


61 


Qy 


260 


Db 


120 


Qy 


320 


Db 


180 


Qy 


380 


Db 


240 


Qy 


440 


Db 


300 


Qy 


500 


Db 


360 


Qy 


560 


Db 


420 


Qy 


620 


Db 


480 


Qy 


680 



II lllllllllllll IMIIIIIIII II Mill II I M 1 1 1 II 1 1 I I II 



Db 



54 0 CTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGG 599 



Qy 74 0 CCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGC 7 99 

MINI Ml I II I i II I ! ! 1 1 : i ! 1 1 MIIIIIIMMII IIMIIII IMIMII 

Db 600 CCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGC 659 

Qy 8 00 CACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGA 856 

Illlllllllllllllllllllll Mill lllllllllllllll II Ml 

Db 66 0 CACC C CG CTGG AGAGGG C C GAG CATCTG CAAAG CT C C CGG CA CCG CGAG C C CTGGG A 716 



RESULT 12 
BQ675698 

LOCUS BQ675698 902 bp mRNA linear EST 15-JUL-2002 

DEFINITION AGENCOURT_8036532 NIH_MGC_102 Homo sapiens cDNA clone IMAGE : 62 1 1 9 17 

5', mRNA sequence. 
ACCESSION BQ67 5698 

VERSION BQ675698.1 GI:21786532 

KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 902) 

AUTHORS NIH-MGC http://mgc.nci.nih.gov/. 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 

JOURNAL Unpubl i shed 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email : cgapbs -r@mail .nih.gov 
Tissue Procurement: ATCC 
cDNA Library Preparation: Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : // image. llnl .gov 
Plate: LLCM2375 row: c column: 22 
High quality sequence stop: 599. 
FEATURES Loca t ion/Qua 1 i f i ers 

source 1. .902 

/organism="Homo sapiens" 
/mol_type=" mRNA " 
/db_xref ="taxon: 9606" 
/ clone= " IMAGE : 6211917" 

/tissue_type= M epidermoid carcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/cl one_l ib= "NI H_MGC_1 02 " 

/note="Organ: salivary gland; Vector: pOTB7; Site_l : Xhol; 
Site_2: EcoRI; cDNA made by oligo-dT priming. 
Directionally cloned into EcoRI/XhoI sites using the 
following 5' adaptor: GGCACGAG (G) . Library constructed 
by Ling Hong in the laboratory of Gerald M . Rubin 
(University of California, Berkeley) using ZAP-cDNA 
synthesis kit (Stratagene) and Superscript II RT (Life 
Technologies). Note: this is a NIH_MGC Library." 

BASE COUNT 194 a 292 c 255 g 161 t 

ORIGIN 



Query Match 41.7%; Score 564.8; DB 13; Length 902; 

Best Local Similarity 87.6%; Pred . No. 5.5e-113; 

Matches 617; Conservative 0; Mismatches 87; Indels 0; Gaps 0; 



Qy 


160 


TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 


219 


Db 


1 


1 IMIIIIIMI MMIMIMI Mill 1 1 II II 1 1 1 II 1 1 1 II 1 1 1 II II MM 

TCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAGCCCCCCGAGCCAGGGGGAGGTGC 


60 


Qy 


220 


CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 

1 1 M 1 1 M M 1 1 1 M 1 1 M M 1 1 II II II II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

CGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTACAACAGCACCCGCGACCGGGTGG 


279 


Db 


61 


120 


Qy 


280 


CCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCA 

1 M 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 M II II II 1 
CCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCGACTACTACGCCAAGGAGGTCA 


339 


Db 


121 


180 


Qy 


340 


CCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAATTCAAGGGCACCCCCC 

M II II II 1 II M II II M 1 1 1 MM MMMMM llllllll 1 II 

CCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATGACAAGTTCAAGCAGAGTACAC 


399 


Db 


181 


240 


Qy 


400 


ACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTAT 


459 


Db 


241 


Mill llllllll 1 1 1 II 1 1 II II 1 1 1 II 1 II 1 1 II II 1 1 II 1 1 II 1 II 1 
ACAGCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGT 


300 


Qy 


460 


TGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGG 

Illllll 1 1 1 1 ] 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 M | | | | | | | | | | | | | | | | | | | | | | 

TGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGG 


519 


Db 


301 


360 


Qy 


520 


AGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCC 


579 


Db 


361 


MM MMMMIMMMM MMMMM II 1 II II II II II II II II II 1 1 1 

AGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACCTCAGCAACCGGCTGCTGGCAC 


420 


Qy 


580 


CCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGA 

MM Mill M MUM 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I Ml 

CCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGA 


639 


Db 


421 


480 


Qy 


640 


CCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTGTTCCTGTGACAGCAAAG 
iii iii ii ii i i i i i iiiiiiii ii iiiiiiii iiiiiiiiiiiii i 


699 


Db 


481 


Ml Ml II II II II 1 II 1 II II 1 II llllllll 1 1 II 1 II II 1 1 II 1 

GCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGG 


540 


Qy 


700 


ATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCA 

1 1 II II 1 1 II II Mill M MMMMM 1 1 llllllll IMIIIIIIIIII 

ATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCA 


759 


Db 


541 


600 


Qy 


760 


CCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCC 


819 


Db 


601 


Illllll llllllllllllll llllllll IIIMIIIMIIMIIMIMMIMM 

CCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCC 


660 


Qy 


820 


AGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAAC 8 63 

MM MM MMMMIMMMIMMIMM 1 II 

AGCATCTGCCAAGCTCCCGGCACCGCCGAGCCCTGGGACACCAC 704 




Db 


661 





RESULT 13 
BI818841 

LOCUS BI818841 



925 bp mRNA linear EST 04-OCT-2001 



DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



603037307F1 NIH_MGC_115 Homo sapiens cDNA clone IMAGE : 51784 33 5', 

mRNA sequence. 

BI818841 

BI818841. 1 GI : 15930391 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 925) 

NIH-MGC http://mgc.nci .nih.gov/ . 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Ema i 1 : cgapbs - r@ma i 1 . n ih . gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/ LLNL at: 
http : //image . llnl .gov 
Plate: LLAM11444 row: n column: 02 
High quality sequence stop: 874. 
Location/Qualifiers 
1. .925 

/organism="Homo sapiens" 
/mol_type=" mRNA" 
/db_xref ="taxon: 9606" 
/clone=" IMAGE: 5178433" 
/labJiost="DH10B" 
/clone_lib="NIH_MGC_115" 

/note="Organ: pooled brain, lung, testis; Vector: 
pCMV-SPORT6; Site_l : Not I; Site__2: EcoRV (destroyed); RNA 
source anonymous pool of 6 male brains, age range 23-27; 1 
male lung, age 27; and 1 male testis, age 69. Library is 
oligo-dT primed and direct ionally cloned (EcoRV site is 
destroyed upon cloning) . Average insert size 1.8 kb, 
insert size range 1-3 kb. Library is normalized and 
enriched for full-length clones and was constructed by C. 
Gruber (Invitrogen) . Research Genetics tracking code 
021. Note: this is a NIH_MGC Library." 
170 a 316 c 278 g 161 t 



Query Match 39.7%; Score 537.6; DB 12; Length 925; 

Best Local Similarity 87.5%; Pred. No. 4.8e-107; 

Matches 611; Conservative 0; Mismatches 84; Indels 3: 



Qy 

Db 

Qy 

Db 



1 0 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 69 

II III I I I I II M I 1 I II II I 1 lllllllllllll llllllllllllll I 
146 CCCGCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 205 

7 0 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 129 

I I I I I I I I I I I , M I , I I I I I I I M i Mill llllllllllllll llllllllll 
206 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 265 



Qy 


130 


AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

IIIIIIIIIMIIIIIIIMIIIIIIIIIII III MM lllllllllll MM 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 


189 


Db 


266 


325 


Qy 


190 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

1 MMMMMMMMMMMI MMMMMMMMMM MMI 1 1 M 1 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 


249 


Db 


326 


385 


Qy 


250 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

1 1 1 MMMM MMMMMMM MMMM MM MMMMMMMM 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 


309 


Db 


386 


445 


Qy 


310 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 MIM MMMMMMMMMMMIMMMMMMMMMM 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCA.CAACG 


369 


Db 


446 


505 


Qy 


370 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 


429 


Db 


506 


MMMMM M MMM 1 1 MMM MMMM 1 MMMM M 1 

AAATCTATGACAAGTTCAAGGAGAGTACACA.CAG^ 


565 


Qy 


430 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

MMIM MMMM M MMI M MMMM MMMMMMM MMMI 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


489 


Db 


566 


625 


Qy 


490 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

MMMMMMMMMMMMMMMMM MMMMMMMMI MMM 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 


549 


Db 


626 


685 


Qy 


550 


GGCGCTACCTCAGCAACCGGCTGC - TGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTT 


608 


nU 
DD 


C Q C 

bob 


MM MMMMMMMMMI MM MMI MMI II MMM 1 II III 

pp (~*(~* ATA PfTPA tv P rT 1 P HTT 1 PTTP P 1 P A PPPAP /~ipi 7\ PTPP P 1 P 1 A P 1 A pmpprpTA i-pP 1 1 " t "T" T ' 

UCjLLA I ALL 1 LACjLAALLLtOLT. LL I I bLLALLCJYCjLLACTLGLCAGAL I CjL I TA I LTTTT 


74b 


Qy 


609 


GATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGA - - GAGGCTATAGAGGGTTTTC 

MMMMMIMIMIIMMIMM Ml III MM II Mill MM 

GATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGACGGGGAACATTGAGGGCTTTC 


666 


Db 


746 


805 


Qy 


667 


GCCTCAGTGCCCACTGTTCCTGTGACAGCAAAGATAAC 704 

MM M MMMM 1 1 1 1 1 1 1 1 1 1 M 1 MM 1 

GCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGATACC 84 3 




Db 


806 





RESULT 14 

AL530080/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



AL530080 841 bp mRNA linear EST 23-MAY-2003 

AL530080 Homo sapiens NEUROBLASTOMA COT 50 -NORMALIZED Homo sapiens 
cDNA clone CS0DD009YM06 3 -PRIME, mRNA sequence. 
AL530080 

AL53 008 0.2 GI : 31067 915 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 841) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 



COMMENT On Feb 13, 2001 this sequence version replaced gi: 12793573. 

Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 

was normalized. Library was constructed by Life Technologies, a 

division of Invitrogen. This sequence belongs to sequence cluster 

9160. r For more information about this cluster, see 

http : / /www . genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CSODD009BG03NPl&cluster=9160 . r . Contact -. 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DD009BG03NP1 . 
FEATURES Location/Qualifiers 
source 1. .841 

/organism= M Homo sapiens" 
/ mo 1 __ t yp e = " mRNA " 
/db_xref ="taxon: 9606" 
/clone=" CS0DD00 9YM06" 

/tissue_type= "NEUROBLASTOMA COT 5 0 -NORMALIZED" 
/clone_lib="Homo sapiens NEUROBLASTOMA COT 5 0 -NORMALIZED" 
/note="lst strand cDNA was primed with a Notl-oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 

BASE COUNT 150 a 240 c 283 g 153 t 15 others 

ORIGIN 

Query Match 39.7%; Score 537; DB 9; Length 841; 

Best Local Similarity 81.4%; Pred. No. 6.3e-107 ; 

Matches 689; Conservative 8; Mismatches 104; Indels 45; Gaps 5; 

Qy 44 2 CGGTG CCGG AA C CTGTATTG CT CT - CTCGGG CAG AG CTG CG C CTG CTGAGG CT CAAGTTA 500 

I I I I I I hill I llhlll I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 CGGTACCTGMACCCTTGTTGYTCTCCCCGGGCAGAGCTGCTTCTGCTAAGGCTCAAGTTA 782 

Qy 501 AAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTC 560 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 II' 1 1 1 1 1 1 1 1 1 1 1 1 . 1 1 1 III II I lUIIMII Mill 

Db 781 AAAGTGGAGC^GCACGTGGAGCTGTACCAGAAATAC^GCAACAATWCCTGGCGATACCTC 722 

Qy 561 AGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGA 62 0 

Mllllhll MINI Mill Mill II MIMI I II MM MMMMM 

Db 721 AGCAACCSGCSGCTGGCACCCAGCSACTCGCCAGAGTGGTTTTCTTTTGTTGTCACCGKA 662 

Qy 621 GTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCAC 68 0 

1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 III Ml III II II Ml 1 1 1 1 1 1 1 1 II Ml II 

Db 661 GTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTAAGG - -TTTCGCCTTAGCGCCNAC 604 

Qy 681 TGTTCCTGTGACAGCAAAGATAAC^CACTCCACGTGGAAATTAACGGGTTCAATTCTGGC 74 0 

I Ml MMM 1 1 1 1 1 1 1 1 ! 1 1 II Ml I II MMMM I I I III 

Db 603 GGCTCCTGTAACAGC^GGGATAACAC^CTGCAAGTGAACATa^ACGGGTTAACTACCGGC 544 

Qy 741 CGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCC 8 00 

Mill Mlllllllllllll MM 1 1 1 1 1 1 1 J 1 1 1 1 J f MMMM IIIIIIMI 

Db 543 CGCCGAGGTGACCTGGCCACCATTCTTGGCATGAACCGGCCTTTCCTGCTTCTCATGGCC 484 



Qy 



801 ACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACC 860 



Db 


483 


ACCCCGCTGGAGAGGGCCCAGCTTCTGCAAAGCTCCCGGCACCGCCGA -~ 


436 


Qy 


861 


AACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACG 

MINIM MIMMI IMIIMIMMIII 

GCCCTGGACACCAACTATTGCTTCAGCTCCACG 


920 


Db 


435 


403 


Qy 


921 


GAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAG 

1 1 1 1 1 1 II 1 II 1 1 1 M II M II M M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 
G AG AAG AACTG CTG CGTG CGG CAG CTGT ACATTAACTT CCG CAAGG AC CT CGG CTGGGAG 


980 


Db 


402 


343 


Qy 


981 


TGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATC 


1040 


Db 


342 


Mill M M MM MM MM II MINI MM MM MIMMI MMMM 

TGGATCCACGAGCCCAAGGGCTWCCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATT 


283 


Qy 


1041 


TGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGC 

MMMM II 1 1 1 Ml 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 M II II 1 1 1 1 1 

TGGAGCCTGGACACGCAGGACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGG-- 


1100 


Db 


282 


225 


Qy 


1101 


GCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTAC 

II 1 II 1 II 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II 1 1 M II 1 1 
GCCTCGGCGGCGCCGTGC-GCGTGCCGCAGGCGCTGGAGCCGCTGCCCTTCGTGTACTAC 


1160 


Db 


224 


166 


Qy 


1161 


GTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGC 
i i i i i I i i i i i i i i i i i i i i i i ii i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ii 

1 II II M 1 M II II II M M M M II II II 1 II II II II II M 1 II II II 1 II II 

GTGGGCCGCAAGCCCAAGGTGGGGCGGCTGTCCAACATGGTCGTGCGCTCCTGCAAGGGC 


1220 


Db 


165 


106 


Qy 


1221 


AGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGC 

MM II 1 1 1 1 J 1 1 1 1 J 1 1 II 1 II 1 II Mill MM MM 1 

CGCTGGGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCCCCCCGCCCCGCC 


1280 


Db 


105 


46 


Qy 


1281 


CCGCCT 12 8 6 

Ih 11 
CCKGCT 4 0 




Db 


45 





RESULT 15 

BI084718/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BI084718 956 bp mRNA linear EST 20-JUN-2001 

602869722T1 NIH_MGC__102 Homo sapiens cDNA clone IMAGE : 5014421 3', 
mRNA sequence. 
BI084718 

BI084718 .1 GI : 14503048 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 956) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs -r@mai 1 .nih.gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: CLONETECH Laboratories, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 



Craniata ; Vertebrata / 
Catarrhini ; Hominidae ; 



Euteleostomi ; 
Homo. 



DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http://image.llnl .gov 
Plate: LLCM1821 row: 1 column: 06 
High quality sequence start: 3 
High quality sequence stop: 793. 
FEATURES Loca t ion/Qua 1 i f i ers 

source 1. .956 

/ organ ism=" Homo sapiens" 
/ mo 1 _ t yp e = " mRNA 11 
/db_xref ="taxon: 9606" 
/ clone= " IMAGE : 5014421" 

/tissue_type="epidermoid carcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/ c 1 one_l ib= "NI H_MGC_1 02 " 

/note="Organ: salivary gland; Vector: pOTB7 ; Site__l : Xhol ; 
Site_2: EcoRI; cDNA made by oligo-dT priming. 
Directionally cloned into EcoRI/XhoI sites using the 
following 5' adaptor: GGCACGAG (G) . Library constructed 
by Ling Hong in the laboratory of Gerald M. Rubin 
(University of California, Berkeley) using ZAP-cDNA 
synthesis kit (Stratagene) and Superscript II RT (Life 
Technologies). Note: this is a NIH_MGC Library." 

BASE COUNT 172 a 273 c 312 g 199 t 

ORIGIN 

Query Match 3 9.7%; Score 536.6; DB 12; Length 956; 

Best Local Similarity 78.0%; Pred. No. 8e-107; 

Matches 757,- Conservative 0; Mismatches 159; Indels 55; Gaps 7; 
Qy 333 GAGGT(^CCCGCGTGCTAATGGTGGAAAGCGGC^CC7^TCTATGATAAATT(^AGGGC 3 92 

Mill I Ml 1 1 1 1 1 1 1 1 1 1 1 I I II II I MM I 

Db 955 GAGGTTAACCGGTGCTAAATGGTGGAAACCCACCACGAATCTATGACCAGGTTCAGCAGA 8 96 

Qy 3 93 ACCCCCCACAGCTTATATATGCTGTTCAACACGTC - - GGAGCTCCGGGAAGCGGTGCCGG 4 50 

II Ml MIM I MM M I MM III MM II 

Db 8 95 GTAC^C^CAGCC^TAAATATGGTCTTCAa^CATTCAGGAGCTTCCGAGAACCGGTACCCT 836 

Qy 4 51 AA C CTGTATTG CT CTCTCGGG CAGAG CTG CG C CTG CTGAGGCTCAAGTTAAAAGT 5 05 

I I M Ml II I II II I I I I I I I I I I I I I I I I I I I I f I 

Db 835 GAACCGTGTTGGCTCTTCCCGGGGAAGAGCTGCCGTCTGCCTGAGGCTCAAGTTAAAAGT 776 

Qy 5 06 -GGAGCAG(^CGTGGAGCTATAC(^GAAATACAGaUVTGATTCCTGGCGCTACCTCAGCA 564 

Ml I I I I I I I I I I I I I I III I I MIMMIII I I I I I I ' I I I 

Db 775 GGGACCAGCACGTGGAGCTGTAACCAGGAATAC^GC^CAATTCCTGGCGATACCTC^GC^ 716 

Qy 565 ACCGGCTGCTGGCCCCCAGTGA- - CTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGT 622 

I II 1 1 M I MM Mill M II MUM I II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

Db 715 ACCGGCTGGTGGCACCCAGCGAACTCGCCCAGAGTGGTTATCTTTTGATGTCACCGGAGT 656 

Qy 623 TGTGCGGCAGTGGCTGACCCGCAG-AGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACT 681 

MMMMMMI Ml III I II II M Mill MMMM M MMMI 

Db 6 55 TGTGCGGCAGTGGTTGAGCCGTGGAAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCCACT 5 96 

Qy 682 GTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTT^CGGGTTCAATTCTGGCC 741 

I MMMMMMI 1 1 II II M I II M Mill II 1 1 1 1 1 1 1 1 1 1 I I MM 



Db 595 GCTCCTGTGACAGCkGGGATAACAC^ 536 

Qy 74 2 GCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCA 801 

llll IMIIMMMIIIIIIMI IIIIMIIIIIIII Illlllll Mllllllll 

Db 53 5 GCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCA 476 

Qy 802 CCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCA 861 

MIIMIIIIIIIIIIIIIIII Mill I h I II 1 1 1 1 II I II I II 1 1! Ml 1 1 

Db 475 CCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTGGAC 42 0 

Qy 862 ACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGG 921 

Illlllll i , 1 1 1 1 i 1 1 1 1 1 M i :' 

Db 419 ACCAACTATTGCTTCAGCTCCACGG 395 



Qy 922 AGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGT 981 

I II II II I II M 1 1 M II I M I II I II M II II M 1 1 1 1 IMIIMI Mllllllll 

Db 394 AGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGT 33 5 



Qy 982 GGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCT 1041 

MM II M 1 1 M M 1 1 1 1 II 1 1 1 1 1 1 M Illlllll Illlllll IMIIMI I 

Db 334 GGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTT 275 

Qy 1042 GGAGCCTAGACACTC^GTACAG(^GGTCCTGGCTCTGTA(^C(^GC^C^ACCCGGGCG 1101 

Illllll Mill 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 IIIIMIIIIIIII Mllllllll 

Db 274 GGAGCCTGGAC^CGC^GTACAG(^GGTCCTGGCCCTGTACAACCAGCATAACCCGGGCG 215 



Qy 1102 CGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACG 1161 

I M II M II I M II M M M I M M M M M M II M II 1 1 M II 1 1 1 II I II M II I 

Db 214 CCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACG 155 

Qy 1162 TGGGCCGC^GCCCAAGGTGGAG^GCTGTCCAACATGATCGTGCGTTCCTGO^GTGCA 1221 

1 1 1 M 1 1 M I III M ! I II 1 1 1 1 1 1 1 . 1 1 1 II I II, i II Ml II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 154 TGGGCCGC^GCC(^GGTGGAGCAGCTGTC(^C^TGATCGTGCGCTCCTG(^GTGCA 95 



Qy 1222 GCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCC 12 81 

MMiii iiiiiiiiiii ii i ii j 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 r 1 1 ii 

Db 94 GCTGAGGTCCCGCCCCGCC CCGCCCCGCCCCGGCAGGCCCGGCCCCACCCCGCCC 4 0 



Qy 1282 CGCCTCACCGG 12 92 

llll I I I 
Db 3 9 CGCCCCCGCTG 2 9 



Search completed: October 28, 2003, 00:08:25 
Job time : 3362.54 sees 
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